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PHENOTYPE AND BIOLOGICAL MARKER IDENTIFICATION SYSTEM 

<;rnPF. of the invention 

5 The present invention provides a phenotype and biological marker identification 

system and methods for identifying and using novel patterns of biological markers related 
to disease, disease progression, response to therapy and normal biological functions. The 
discovery and use of novel patterns of biological markers will result in more cost-effective 
drug development, including the improvement of patient selection in clinical trials and the 

1 0 identification of therapeutics with greatly improved safety and efficacy. Phenotype 
information and biological markers can also be used in diagnostic applications. 



RArK-cRnirND OF the invention 

As a result of recent innovations in drug discovery, including genomics, 
1 5 combinatorial chemistry and high throughput screening, the number of drug candidates 
available for clinical testing exceeds the phaimaceutical industry's development and 
economic capacity. In 1998, the world's top pharmaceutical and biotechnology companies 
spent more than $50 billion on research and development, more than one-third of which 
was spent directly on clinical development. As the result of a number of factors, including 
20 increased competition and pressure from managed care organizations and other payors, the 
pharmaceutical industry is seeking to increase the quality, including the safety and efficacy 
of new drugs brought to market, and to improve the efficiency of clinical development. 

Recent drug discovery innovations, therefore, have contributed to a clinical tnals 
bottleneck. The numbers of therapeutic targets being identified and lead compounds being 
25 generated far exceed the capacity of pharmaceutical companies to conduct clinical trials as 
they are currently performed. Further, as the industry currently estimates that the average 
cost of developing a new drug is approximately $500 million, it is prohibitively expensive 
to develop all of the potential drug candidates. 

The pharmaceutical industry is being forced to seek equivalent technological 
30 improvements in drug development. Clinical trials remain very expensive and very risky, 
and often decision making is based on highly subjective analyses. As a result, it is often 
difficult to determine the patient population for whom a drug is most effective, the 
appropriate dose for a given drug and the potential for side effects associated with its use. 
Not only does this lead to more failures in clinical development, it can also lead to 
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approved products that may be inappropriately'dosed, prescribed, or cause dangerous side 
effects. With an increasing number of drugs in their pipelines, pharmaceutical companies 
require tecluiologies lo identify objective measurements of a drug candidate's safety and 'J 
efficacy profile earlier in the drug development process. 

5 One approach to deal with the mass of information and technologies is to break 

away from the traditional methods of drug identification and development. As a vanety of 
different analytical, clinical and information handling technologies continually advance, it 
may be possible to develop a phenotype for an individual or population that allows for an 
unprecedented systematic evaluation of such individual or population. The phenotype for a 

10 given individual includes, in theory, all measurable characteristics of such individual at all 
points in time. One use of such phenotype information is the identification of biological 
markers. 

Biological markers are characteristics that when measured or evaluated have, inter 
alia, a discrete relationship or correlation as an indicator of normal biologic processes, 
15 pathogenic processes or pharmacologic responses to a therapeutic intervention. 

Pharmacologic responses to therapeutic inten-ention include, but are not limited to, 

response to the intervention generally (e.g., efficacy), dose response to the intervention. 

side effect profiles ofthe intervention, and pharmacokinetic properties. Response may be J 

correlated with either efficacious or adverse (e.g.. toxic) changes. Biological markers 
20 include patterns of cells or molecules that change in association with a pathological process 
and have diagnostic and/or prognostic value. Biological markers may include levels of cell 
populations and their associated molecules, levels of soluble factors, levels of other 
molecules, genotypic information, gene expression levels, genetic mutations, and clinical 
parameters that can be correlated with the presence and/or progression of disease. 
25 In contrast to such clinical endpoints as disease progression or recurrence or quality 

of life measures (which typically take a long time to assess), biological markers may 
provide a more rapid and quantitative measurement of a drug's clinical profile. Single 
biological markers currently used in both clinical practice and drug development include 
cholesterol, prostate specific amigen ("PSA"). CD4 T cells and viral RNA. Unlike the well 
30 known correlation between high cholesterol and heart disease. PSA and prostate cancer, 
and decreased CD4 positive T cells and viral R.NA in AIDS, the biological markers 
correlated with most other diseases have yet lo be identified. As a result, although both 
government agencies and phamiaceutical companies are increasingly seeking development 
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of biological markers for use in clinical trials, the use of biological markers in drug 
development has been limited to date. 

Although there are many potential biological markers, there is limited technology 
that is capable of sorting through the vast amounts of information needed to establish the 
correlation of the biological markers with normal biologic processes, disease, disease 
progression and response to therapy. Phenotyping requires the instrumentation and assays 
required to measure hundreds to thousands of parameters, an infonnatics system to allow 
this data to be easily accessed, software to correlate the patterns of information with 
clinical data and the ability to utilize the resulting information in the drug development 
process. The present invention provides such a technology. 

fsIIMMARY OF THF INVENTION 

The present invention relates to phenotyping an organism oraclassor subclass of 
organisms. The present invention also includes the identification of biological markers that 
are measured and evaluated as an indicator of normal biologic processes, pathogenic 
processes or phamiacologic responses to a therapeutic intervention. This invention 
includes technology capable of providing quantitative, sensitive reproducible and rapid 
measurements of multiple and diverse biological markers that could accurately profile an 
organism's phenotype or a patient's disease status and response to therapy. Further, 
because blood is the single most infomnation rich tissue and is easily and readily accessible 
for testing, the invention focuses on identifying biological parameters from small samples 
of blood. The invention includes a multidisciplinary format comprising three principal 
elements: instrumentation, assay development and clinical informatics. 

RRTRF DESCRIPTION OF THH FIGURES 

Figure 1 is a schematic representation of the types of information that are 
assimilated to obtain one embodiment of a biological marker identification system. 

Figure 2 depicts a schematic representation of the improved MLSC instrument of 
the invention (term "SurroScan" instrumentation). 

Figure 3 depicts the integrated information infrastructure for analyzing the data 

obtained in the present invention. 
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Figures 4 A - C depict the results obtained in Example 1 showing that CD27" and 
CD27' CDS T cells vary among samples. Blood samples from three different donors 
(Figures 4A, B, and C) were stained with CyS anti-CD27 and Cy5.5 anti-CDS. 

Fiaures 5 A and B depict robust cellular measurements with 2-color MLSC. Figure 
5 5A demonstrates the consistency of CDS T cell counts from 6 different capillanes. Cy5.5 
anti-CDS was combined with a different CyS conjugated antibody for each of the 
capillaries (anti-CD3, CD25, CD7, CD45RA, CD62L, CD69)- Fifty different blood 
samples were analyzed. The box-and-whiskers plots show that the distributions of cell 
counts are very similar for each of the capillaries. Pair-wise linear regression also shows a 
10 high degree of consistency for these assays (data not shown). Figure 5 shows the 

consistency of two measures of B cells, one with CyS. 5 anti-CD20 and one with CyS.S 
anti-CDl9. The 95% confidence interval (dotted line) of the linear regression includes a 
slope of 1 and the fit has a correlation coefficient of 0.97. 

Figure 6 shows a classification matrix comparing CDS T cells and CD4 T cells m 
1 S RA patient samples and blood bank samples. 

Figures 7 A and B show results of a three color cellular assay on the SurroScan 

instrument. 

Figures 8 A - C shows the results of staining intracellular molecule as measured 

with MLSC technology. 
20 Figures 9A - C show the results of a 3 detection channel analysis using MLSC 

technology. 

DRT AILED DESCRIPTION OF THF IKVENTION 

The present invention is directed to the phenotyping of an organism or a class or 

25 subclass of organisms. In theory, the phenotyping of an organism includes obtaining all 
measurable characteristics of said individual, past and present. While the complete 
phenotyping of any organism is not practical or even possible, the phenotyping disclosed 
and described herein provides an unprecedented quantity of an unprecedented number of 
types of parameters or characteristics so as to provide a resource of information that wjll 

30 allow for the analysis of normal biological functions, disease, disease progression and 
chanoes associated with virtually any perturbation to the organism. 

One utility of the phenotyping system taught by the present invention is to the 
identification of biological markers for normal biological processes, diseases or medical 
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conditions. In order .0 perform th,s aspect of the present invention it is necessan. to have i) 
biological infomiation from a population of individuals, ii) an adequate amount of data 
3 from each individual, preferably obtained by multiple sampling over time, and ii.) an 

information storage and retneval system that a) can imegraiively incorporate a wide vanety 
5 of types of infonTiation and b) can perform meaningful correlation analysis of the disparate 
types of data. Figure 1 depicts information that is useful to create a biological marker 

identification system. 

Disease and disease progression involves the complex interplay of both genetic and 

environmental factors. The present invention has the potemial to identify and trace 
10 changes in patterns of biological markers reflecting both genetic and environmental factors 
from small samples of blood. Furthemiore, the present invention helps decipher genetic 
components of disease susceptibility, disease progression and response to therapy. 

The present invention is capable of monitoring cells, proteins, organic molecules, 
genotype, soluble factors, clinical and environmental factors, all of which have been used 
15 as biological markers in drug development and as disease markers. Examples of known 
biological marker include the monitoring for decreases in CD4 positive T cells and viral 
RNA levels in AIDS, elevated cholesterol levels as an accepted biological marker for heart 
disease and changing levels of PSA as a protein marker found in the blood of prostate 
cancer patients. 

20 . Since the biological characteristics or parameters that might be discovered to be a 

biological marker or part of a marker "grouping" are often not predictable, it ,s essential 
that the appropriate database contain information regarding as many parameters as 
possible. 

The present invention extends to a phenotype of a given organism, methods for 
25 assembling such phenotype and methods for utilizing such phenotype. The phenotype of 
an organism or class or subclass or organisms comprises a large compilation of data 
relating to the organism or class or subclass of organisms. The novel aspect of the present 
invention lies in the disparate nature of the data and the quamity of data from each of the 
various categories of data available on an organism. A phenotype can only reach its full 
',0 usefulness if the data defining the phenotype is extensive. For example, a phenotype for a 
human patient containing a standard blood profile and clinical factors routinely obtained 
from a physical examination can not provide enough information to fully exploit such a 
phenotype. Although the assays involved and data obtained are within the scientific and 
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clinical capabilities of the art, obtaining all of the information from a single organism is a 
novel task. Although the handling and maintenance of a phenotype lends itself to 
computerization, a given phenotype can be kept in traditional formats. Manipulating 
phenotyTjes to identify a biological marker or to observe the effect of a perturbation in the 
5 organism is, of course, greatly simplified by the use of computational analysis via a 
computer. As described above, the complete phenotyping of an organism would include 
literally thousands or possibly millions of data points. In the preferred aspects of this 
invention, a phenotype compnses greater than 40 biological parameters, more preferably 
greater than 100 parameters, and most preferably greater than 200 different parameters, and 
1 0 in some cases greater than 300 different parameters. The phenotype must contain 

biological parameters that include infonmation from cellular assays, soluble factor assays 
and clinical information. In the preferred embodiment, the results of at least 20 cellular 
assays incorporating measurements of at least 20 cell populations and/or cell associated 
molecules and the results of at least 20 soluble factor assays are included in the phenotype. 
1 5 along with clinical information. In more preferred embodiments, the results of at least 40 
cellular assays incorporating measurements of at least 40 cell populations and/or cell 
associated molecules and at least 40 soluble factor assays are included, preferably with an 
extensive battery of clinical and environmental parameters. In preferred embodiments of ' 
the invention there are included more than 20 clinical parameters, preferably more than 40 
20 and in some cases more than 60 clinical parameters. 

A rich and readily accessible source of biological information for a patient is the 
blood. At the present time, there are over 200 identified discrete leukocyte cell surface 
antigens with identified antibodies. In addition, there are literally thousands of proteins 
and other soluble factors and small molecules that can be identified in blood. The problem. 
25 therefore, is not in finding enough infomational content in the blood, but in efficiently 
extracting all of the available information from limited quantities of blood. 

Many levels of biological markers may vary widely from individual to individual. 
In many cases, such variations may be random, but this may not always be the case. For 
example, in some situations baseline levels may be individual specific, and only by taking 
30 multiple readings from an individual would it be possible to identify a biological marker. 
Although it may not be likely that a baseline would be established for a healthy individual, 
there may be valuable information gained from the variations over time in a given 
individual that has a disease or medical condition. For example, a patient with rheumatoid 
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arthnlis mav show interesting vanations when off or on medicine, or when exhibiting a 
severe flare-up of symptoms. If such longitudinal correlations exist, review of the 
longitudinal data of other similarly situated patients could confirm valuable biological 
markers associated with the disease. \Vhen longitudinal data over an extended period of 
5 time exists, the number of individuals necessary- for the analysis to be statistically 

significant can be relatively small. 

An additional application of the present invention is in monitoring dose response 
studies. In this embodiment, a population of individuals is evaluated before and after the 
administration of drug and after increasing doses of the drug. In this embodiment, the 
10 selected population may be healthy individuals, and the anticipated biological dose 

response endpoint is toxicity or side effect profiles. In embodiments where the individuals 
have a particular disease or medical condition, markers may be identified for efficacy along 
with the negative effects of the drug. By evaluating the infomiation from individuals 
before and after administration of dnigs it will be possible to identify markers or marker 
1 5 groupings associated with administration and response to the drug. In some situations. 

such markers could be used as an endpoint for clinical studies. For example, in contrast to 
such clinical endpoints as disease progression or recurrence or quality of life measures 
(which typically take a long time to assess), biological markers may provide a more rapid 
and quantitative measurement of a drug's clinical profile. 
20 In other embodiments of the present invention, longitudinal studies of individuals 

receiving a drug or treatment for the prevention or treatment of a disease or medical 
condition could constitute the population of individuals being evaluated. By correlating 
biological indicators of individuals before they receive treatment with subsequent clinical 
observations, it will be possible to identify biological markers associated with those 
25 members of a potential patient population that will most benefit from the treatmem therapy 
In such a mamier, expensive treatments can be limited to the subpopulation of patients 

most likely to benefit from the treatment. 

Another application of the present invemion is the use of biological markers to 
identify patients who have very early clinical signs of a disease. This would be extremely 
30 valuable for a multitude of disease states where a patient may have -'subclinical" signs and 
symptoms which are not severe enough to bring them to a doctor's office. However, if a 
patient had a marker which was discovered in their blood and they were advised to seek 
medical attention, their "subclinical" signs could be identified as their earliest phenotypic 
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presentation of a disease. For many diseases, it is extremely advantageous to diagnose a 
disease as early as possible so that therapeutic drugs may be started and generally lead to 
reduced morbidity and mortality of that disease entity for that individual. A possible 
scenario would be if a patient could take a blood test to see if they have a biological marker 
5 for Rheumatoid Arthritis. If the marker were present, they could then seek treatment 
during the '-subclinical'- stage where they may only have a sensation of warmth in their 
joints'instead of waiting until they have joint pain, swelling and deformity. That individual 
would likely have a much better long-temi outcome for Rhfeumatoid Arthritis in 
comparison to someone who waits until they have a much later stage of the disease before 

10 seeking treatment. 

The present invention is directed to the phenotyping of an organism or class or 
subclass of organisms. The phenotype is made up of data from a large number of data 
categories. The principle categories of data included within the scope of this invention are 
i) levels of cell populations including their cell associated molecules in biological Quid, ii) 
1 5 levels of soluble factors in the biological fluid, iii) drug dosing and pharmacokinetics 
(measurement of a drug and its metabolites in a body) and iv) clinical parameters. 
Additional categories of data may include, but are not limited to. i) levels of small 
molecule compounds in biological fluid, ii) genotype information regarding the individual, 
including the individual's genetic makeup and gene expression (mRNA or transcripts) 
20 levels, and iii) data obtained from assays of urine components. In certain embodiments 
data categories may include images such as x-ray, CAT scans of the brain or body, or 
MRIs, or information obtained from biopsies, EKGs, stress tests, endoscopies, ultrasound 
exams, laparascopic procedure, orthroscopic surgeries. PET scans, or any other 
measurement of an individual's condition. 
25 In the preferred embodiment, the clinical parameters included in the database of the 

present invention would include, but not be limited to. the individual's age, gender, weight, 
height, body type, medical history (including comorbidities, medication, etc.), 
manifestations and categorization of disease or medical condition (if any) and other 
standard clinical obsei^ations made by a physician. Also included among the clinical 
30 parameters would be environmental and family history factors. 

Clinical parameters could be further characterized by the source from which the 
infomiation which is obtained. Patient obtained clinical parameters may include 
information that the patient provides via a questionnaire such as the WOMAC for 



BN50OCID <WO.. 0055A72At L> 



wo 00/65472 



PCT/USOO/11296 



9 



ti'''" 



osteoanhritis. and the Health Assessment Questionnaire for Rheumatoid Arthritis which 
may be filled out in a doctor's office. Similarly, electronic or web-based questionnaires 
) addressing all of a patient "s current clinical symptoms could be completed by the patient 

prior to a clinic visit. Information obtained by a nurse would include vital signs, 
5 information from a variety of tests including allergy testing, pulmonary function testing, 
stress-thallium testing, or ECG tests. Clinical parameters collected from a physician 
includes a detailed history of prior illnesses, surgeries, hospitalizations, medications, 
reactions to medications, family history, social history, alcohol/drug/smoking history, as 
well as other behavior which would put a patient at high risk for HIV or Hepatitis. A 
1 0 thorough physical exam is also perfomied by a clinician and is a crucial component of a 

patient's clinical parameters. 

In the preferred embodiment, the levels of cell populations and their associated 
molecules are identified by microvolume laser scanning cytometry. Such data can also be 
obtained by flow cytometry, but the volume of blood necessary to perform the flow 
1 5 cytometry assays places a serious limit on the number of assays that can be performed on 
blood taken from a given individual at one time. In addition, the sample preparation 
required for performing flow cytometry assays is time consuming, expensive, and may 
interfere with the measurement result. 

The levels of soluble factors can be measured by any suitable technique. In the 
20 preferred embodiment, the levels of soluble factors is measured by standard immunoassay 
techniques, such as ELISA techniques. In an alternative embodiment, microvolume laser 
scanning cytometry is used to obtain levels of soluble factors. Soluble factors can be 
detected by immunoassays such as MLSC. ELISA, etc.. mass spectrometry, 2D gel 
electrophoresis, combinations of mass spectrometry and immunosorption, and chemical 
25 assays. In the preferred embodiment, cell populations are detected by MLSC assays and 
soluble factors are detected by immunoassays or mass spectrometry. 

The invention includes improved instrumentation for the rapid, reproducible and 
quantitative evaluation of biological parameters from a small quantity of blood; 
miniaturized, high sensitivity assays compatible with improved instrumentation for the 
30 detection of hundreds to thousands of biological parameters in blood; a broad clinical 
strategy to collect extensive medical information content from patients who are followed 
over time; software, databases and data mining tools to correlate patterns of parameters 
with normal biological functions, specific diseases, disease progression and response to 
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therapy; databases of clinical data and biological markers in collaboration with academic 
centers and clinical research institutes for use in drug development; development of 
diagnostic tests using proprietary patterns of markers and the ability to improve the 
efficiency of drug dcN elopment by enabling more informed decisions in choosing lead 
5 compounds and identifying patients more likely to benefit from a given therapy. 

The unique ability to phenotype an organism and to conduct reproducible and rapid 
measurements of large numbers of biological parameters is essential for the present 
invention to identify novel patterns of biological markers from small samples of blood. 
Statistical analyses to date have shown that the assays for the numbers of different cell 
10 subsets or cell populations, are quantitative and highly reproducible. The present 

technology, which uses small volumes of blood and requires limited handling of patient 
samples, has distinct advantages over other commercially available measurement 
technologies. 

The invention ftirther includes studies of patiem populations related to particular 
15 diseases. These studies are based upon statistical analyses of disease patterns and require 
the collection of large numbers of blood samples from affected individuals. In addition, the 
present invention has utility for phenotyping and identifying biological markers in plants 
and animals and for assisting in preclinical studies. , J 



20 Definitions 

As used herein the term "phenotype" or "phenotyping" refers to a compilation 
composing a substantial subset of all measurable characteristics of an organism. Such 
characteristics or parameters include, but are not limited to, levels of cell populations and 
their associated molecules, levels of soluble factors, levels of other molecules, genotype 
25 information, gene expression levels, genetic mutations, and clinical parameters. Such 

charactensiics or parameters include all historical data and present data. For example, an 
organism's complete phenotype includes all measurable characteristics at the present time, 
as well as all such characteristics at all past points of time. In addition to technically 
measurable characteristics, the phenotype can include an organism's feelings or emotions 
30 (in the case where the organism is a human, the phenotype includes the individual's mental 
state, e.g., depression, pain, agitation, mental illnesses, chemical dependencies); diet and 
changes in diet, injuries, relational histoo'. sexual practices, socio-economic status. 
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An used herein the term "organism" refers to all plants, animals, viruses and 
exoterrestial materials. Included within this definition, but not limited in any way, are 
humans, mice, rats, rabbits, companion animals, natural and genetically engineered plants, 
and natural and genetically engineered animals. 
5 A given phenotype might include a compilation of characteristics of a single 

oroanism or a class or subclass of organisms. For example, the phenotypic data may be 
obtained from a single male individual who has been diagnosed with cancer before and 
after therapeutic intervention, a group of males between the age of 1 5 and 55. or a group of 
males between the ages of 15 and 55 diagnosed with cancer. In this manner, the phenotype 
10 may be specific to a given individual, or may represent the average or typical condition of a 
combined group of individuals. 

The phenotype of an individual organism or group of organisms may be used for a 
variety of purposes. In the broadest scheme of the invention, the phenotype is looked at 
longitudinally and evaluated after some perturbation to the organism. For example, the 
15 comparison of the phenotype of an individual before and after exhibiting symptoms of 
asthma could be used to identify biological markers associated with asthma. In another 
example, the phenotype of an individual who has asthma can be compared with the 
phenotype of a population of normal adults. In another example, the phenotype of a 
naturally occurring plant can be compared with the phenotype of a genetically altered plant 
20 to determine what measurable characteristics are altered by the introduction of the genetic 
alteration. A further example of the use of phenotyipng infonnation would be to 
periodically monitor well-patient status of an individual and to track measures of biological 
aging processes. The potential uses for comprehensive phenotypic data for an organism are 
almost infinite. 

25 The present invention includes phenotypes for an organism or class or subclass of 

organisms, methods for obtaining such phenotypes and methods for utilizing such 
phenotypes. including for the identification of biological markers. 

As used herein the term "biological marker" or "marker" or "biomarker ' means a 
characteristic or parameter that is measured and evaluated as an indicator of normal and 

30 abnormal biologic processes, pathogenic processes or pharmacologic responses to a 

therapeutic intei^-ention. Pharmacologic responses to therapeutic mtervention include, but 
are not limited to. response to the inter%'ention generally (e.g., efficacy), dose response to 
the inter^•emion. side effect profiles of the inten'ention. and pharmacokinetic properties. 
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Response may be correlated with either efficacious or adverse (e.g., toxic) changes. 
Biological markers include patterns or ensembles of cells or molecules that change in 
association with a pathological process and have diagnostic and/or prognostic value. 

Biological markers include, but are not limited to, levels of cell populations and 
their associated molecules, levels of soluble factors, levels of other molecules, gene 
expression levels (mlWA or transcripts), genetic mutations, and clinical parameters that 
can be correlated with the presence and progression of disease, normal biologic processes 
and response to therapy. Single biological markers currently used in both clinical practice 
and drug development include cholesterol, PSA, CD4 T cells, and viral RNA. Unlike the 
well known correlations between high cholesterol and heart disease, PSA and prostate 
cancer, and CD4 positive T cells and viral RMA and AIDS, the biological markers 
correlated with most other diseases have yet to be identified. As a result, although both 
government agencies and pharmaceutical companies are increasingly seeking development 
of biological markers for use in clinical trials, the use of biological markers in drug 

development has been limited to date. 

As a non-limiting example, biological markers are often thought of as having 
discrete relationships with noraial biological status, a disease or medical condition, e.g., 
high cholesterol correlates with an increased risk of heart disease, elevated PSA levels 
correlate with increased risk of prostate cancer, reduced CD4 T cells and increased viral 
RNA coaelate with the presence/progression of AIDS. However, it is quite likely that 
useful markers for a variety of diseases or medical conditions may consist of significantly 
more complex patterns. For example, it could be discovered that lowered levels of one or 
more specific cell surface antigens on specific cell type(s) when found in conjunction with 
elevated levels of one or more soluble factors - - cytokines, perhaps - - is indicative of a 
5 particular auto-immune disease. Therefore, for the purposes of this mvention, a biological 
marker may refer to a pattern of a number of indicators. 

As used herein the terni "biological marker identification system" means a system 
for obtaining information from a patient population and assimilating the infomiation in a 
manner that enables the correlation of the data and the identification of biological markers. 
0 A biological marker identification system comprises an integrated database comprising a 
plurality of data categones, data from a plurality of individuals corresponding to each of 
said data categories, and processing means for correlating data within the data categories, 
wherein correlation analysis of data categones can be made to identify the data category or 
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catecTories where individuals having said disease or medical condition may be 
differentiated from those individuals not having said disease or medical condition, wherein 
said identified category or categories are markers for said disease or medical condition. 
Additionally, markers may be identified by comparing data in various data categories for a 
5 sinole individual at different points of lime, e.g., before and after the administration of a 
drug. 

As used herein the term "data category" means any type of measurement that can be 
discerned about an organism. Examples of data categories useful in the present invention 
include, but are not limited to, numbers and types of cell populations and their associated 
10 molecules in the biological fluid of an organism, numbers and types of soluble factors in 
the biological nuid of an organism, information associated with a clinical parameter of an 
organism, cell volumetric counts per ml of biological fluid of an organism, numbers and 
types of small molecules in the biological fluid of an individual, genomic information 
associated with the DNA of an organism and gene expression levels. For example, a single 
1 5 data category would represent the concentration of IL- 1 in the blood of an organism. 
Additionally, a data category could be the level of a drug or its metabolites in blood or 
urine An additional example of a data category would be absolute CD4 T cell count. The 
number or information assigned to an organism or class or subclass of organisms at any 
given point in time in part comprises the phenotype of that organism. 
20 As used herein the term "biological fluid" means any biological substance, 

including but not limited to. blood (including whole blood, leukocytes prepared by lysis of 
red blood cells, peripheral blood mononuclear cells, plasma, and serum), sputum, urine, 
semen, cerebrospinal fluid, bronchial aspirate, sweat, feces, synovial fluid and whole or 
manipulated tissue. Biological fluid typically contains cells and their associated molecules, 
25 soluble factors, small molecules and other substances. Blood is the preferred biological 
fluid in this invention for a number of reasons. First, it is readily available and can be 
drawn at multiple times. Blood replenishes, in part, from progenitors in the marrow over 
time. Blood is responsive to antigenic challenges and has a memoiy of antigenic 
challenges. Blood is centrally located, recirculates and potentially reports on changes 
30 throughout the body. Blood contains numerous cell populations, including surface 

molecules, internal molecules, and secreted molecules associated with individual cells. 
Blood also contains soluble factors that are both self, such as cytokines, antibodies, acute 
phase proteins, etc., and foreign, such as chemicals and products of infectious diseases. 
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As used herein the term "cell population" means a set of cells with common 
characteristics. The characteristics may include the presence and level of one, ^vo, three 
or more cell associated molecules, size, etc. One, two or more cell associated molecules Q 
can define a cell population. In general some additional cell associated molecules can be 
5 used to ftirther subset a cell population. A cell population is identified at the population 
level and not at the protein level. A cell population can be defined by one. txvo or more 
molecules. Any cell population is a potential marker. 

As used herein the temi "cell associated molecule" means any molecule associated 
with a cell. This includes, but .s not limited to: 1) intrinsic cell surface molecules such as 
1 0 proteins, glycoproteins, lipids, and glycolipids; 2) extrinsic cell surface molecules such as 
cytokines bound to their receptors, immunoglobulin bound to Fc receptors, foreign antigen 
bound to B cell or T cell receptors and auto-antibodies bound to self antigens; 3) intrinsic 
internal molecules such as cytoplasmic proteins, carbohydrates, lipids and mRNA, and 
nuclear protein and DN A (including genomic and somatic nucleic acids); and 4) extrinsic 
1 5 internal molecules such as viral proteins and nucleic acid. The preferred cell associated 
molecule is typically a cell surface protein. As an example, there are hundreds of 
leukocyte cell surface proteins or antigens, including leukocyte differentiation antigens 
(including CD antigens, currently through CD166, see. Leucocyte Typing VI. Kishimoto. _ 
T. sLaL- ED, 1997), antigen receptors (such as the B cell receptor and the T cell receptor). 
20 and major histocompatibility complex. Each of these classes encompass a vast number of 
proteins. A list of exemplary cell surface proteins is provided in Table 1, which is merely 
an illustration of the vast number of cell surface proteins and is in no way intended to be a 
comprehensive list. 

As used herein the term "soluble factor" means any measurable component of a 
25 biological nuid or tissue that is not a cell population or cell associated molecule. Soluble 
factor includes, but is not limited to, soluble proteins, carbohydrates, lipids, lipoproteins, 
steroids, other small molecules, including metallic, inorganic, ionic and metallorganic 
species and complexes of any of the preceding components, e.g., cytokmes and soluble 
receptor; antibodies and antigens; and a drug complexed to an>ahing. Soluble factors can 
30 be both self, such as cytokines, antibodies, acute phase proteins, etc., and foreign, such as 
chemicals, products of infectious diseases and intestinal flora and fauna. Soluble factors 
may be intrinsic, i.e., produced by the organism, or extnnsic such as a vinis, drug or 
environmental toxin. Soluble factors can be small molecule compounds such as 
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prostaglandins, vitamins, metabolites (such as iron, sugars, amino acids, etc.), drugs and 
drug metabolites. A list of exemplary soluble proteins is provided in Table 6, which is 
merely an illustration of the vast number of soluble proteins and is in no way intended to be 

a comprehensive list. 

For the purposes of this invention, soluble factors may be either known or unknown 
entities. A variety of techniques are available where a given species may be identifiable, 
but the chemical identity of the species is unknown. In the present invention, the chemical 
identity of the soluble factor need not be currently known or known at the time the assay is 
performed to determine its presence or absence. 

As used herein the term "small molecule" or "organic molecule" or "small organic 
molecule" means a soluble factor or cell associated factor having a molecular weight in the 
range of 18 to 1 0,000. Small molecules can include, but are not limited to, prostaglandins, 
vitamins, metabolites (such as iron, sugars, amino acids, etc.), drugs and drug metabolites. 

As used herein the term "disease or medical condition" means an interruption, 
cessation, disorder or change of body functions, systems or organs. Examples of disease or 
medical conditions include, but are not limited to. immune and inflammatory conditions, 
cancer, cardiovascular disease, infectious diseases, psychiatric conditions, obesity, and 
other such diseases. By way of illustration, immune and inflammatory conditions include 
autoimmune diseases, which further include rheumatoid arthritis (RA), multiple sclerosis 
(MS), diabetes, etc. 

As used herein the term "perturbation" means an exterior or interior measurable 
event that can occur to an organism. A simple example would be the administration of a 
therapeutic agent to an individual, or an individual that was healthy and then developed 
asthma. In this application a perturbation may also include differences between an 

5 individual or groups of organisms that are being compared. For example, a population of 
animals may be considered to be normal, and their phenotype is being compared to the 
phenotype of a similar but genetically altered animal. The individual genetically altered 
animal was perturbed in the sense that its genetic alteration was perturbed from normal. In 
many cases the perturbation is not a single event that occurs at a discrete point in time. The 

0 perturbation may occur over an extended period of time, and^or may be cyclical or 
intermittent. 

As used herein the term "clinical parameter" means information that is obtained thai 
may be relevant to a disease or medical condition. Such information may be supplied by 
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the patient or by a medical or scientific observer. Examples of clinical parameters for 
humans include, but are not limited to, age, gender, weight, height, body type, medical 
history, ethnicity, family history, genetic factors, environmental factors, manifestation and 
cateoorization of disease or medical condition, and any result of a clinical lab test, such as 

5 blood pressure, MRI, x-ray. etc. 

Clinical parameters could be further characterized by the source of information 
which is obtained. Patient obtained clinical parameters may include information that the 
patient provides via a questionnaire such as the WOMAC for osteoarthritis, and the Health 
Assessment Questionnaire for Rheumatoid Arthritis which may be filled out on paper in a 

10 doctor's office. Similarly, an electronic or web-based questionnaires addressing all of a 

patient's current clinical symptoms could be completed by the patient prior to a clinic visit. 
Infonnation obtained by a nurse would include vital signs, information from a variety of 
tests including allergy testing, pulmonary function testing, stress-thallium testing, or ECG 
tests. Clinical parameters collected from a physician includes a detailed history of prior 

1 5 illnesses, surgeries, hospitalizations, medications, reactions to medications, family history, 
social history, alcohol/dojg/smoking history, as well as other behavior which would put a 
patient at high risk for HIV or Hepatitis. A thorough physical exam is also performed by a 
clinician and is a crucial component of a patient's clinical parameters. 

As used herein the term "genotype information" means any data relating to the 

20 organisms genetic makeup, gene mutations, gene expression, e.g., mRNA or transcription 
levels, and any other measure or parameter associated with the genetic material of the 
organism. 

As used herein the term "clinical endpoint'' means a characteristic or variable that 
measures how a patient feels, functions, or survives. There are several mechanism which 

25 are commonly used to measure how a patient feels or functions with a specific disease and 
they often include validated clinical questionnaires. These may be self administered such 
as the Beck's depression questionnaire or the International Prostate Questionnaire to 
determine if changes in urination are due to prostatic hypertrophy v. bladder outlet 
obstruction. These tools may be given by a health care provider who is judging features 

30 such as facial expression, inability of patient to sit down for more than 10 minutes, level of 
aoitaiion etc. while completing the Carrol Questionnaire to determine if a patient is manic. 
And finally, in the case of psychiatric illness, typically patient's who are admitted for a 
hospitalization for an acute exacerbation of their illness will be observed without realizing 
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it bv a clinician to note the>r ability to function in a vanety of settings including group 
interactions or making lunch. These "clinical endpo.nls- are highly variable per disease 
entity and subsequently the tools which are used to characterize these endpoints are also 
quite broad. 

As used herein the term "Microvolume Laser Scanning Cytometry- or "MLSC" 
means a method for detecting the presence of a component in a small volume of a sample 
using a fluorescently labeled detection molecule and subjecting the sample to optical 
scar^ine where the fluorescence emission is recorded. The MLSC system has several key 
features that distinguish it from other technologies: I ) only small amounts of blood (5-50 
Hi) are required for many assays; 2) absolute cell counts (cells/ ^1) are obtained; and, 3) the 
assay can be executed either directly on whole blood or on purified white blood cells. 
Implementation of this technology will facilitate measurement of several hundred different 
cell populations from a single harvesting of blood. The MLSC tectaology is descnbed in 
United States Patem Numbers 5,547.849 and 5,556,764 and in Dietz et al. (Cytometir 
5 -73:177-186 (1996)), and provisional patent application entitled "Laser-Scanner Confocal 
Time-Resolved Fluorescence Spectroscopy System" (United States Provisional Application 
Number 60/144,798, filed July 21, 1999), and the commonly-owned utility application 
filed concurrently with the present application, entitled "System for Microvolume Laser 
Scanning Cytometry", each of which is incorporated herein in its entirety. Laser scanning 
0 cytometry with microvolume capillaries provides a powerful method for monitoring 

fluorescently labeled cells in whole blood, processed blood, and other fluids. The present 
invention further improves MLSC technology by improving the capacity of the MLSC 
instrument to do simultaneous measurement of multiple biological markers from a small 
quantity of blood. A schematic of the improved SurroScan optical system is shown in 
25 Figure 2. 

As used herein the term "tag" means any entity or species, including but not limited 
to an atom, a molecule, a fragment of molecule or a functional group; a particle or 
combination of particles; a single or sequence of electromagnetic pulses; or any other form 
of matter associated with, attached to (either covalently or non-covalently), or otherwise 
30 comiected to a component of a biological system (a molecule or collection of molecules 

such as cell, a cation, an anion, an atom, or any supramolecular assembly, including but not 
limited to non-covalent complexes between biological molecules) that is used to, identify. 
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quantify, associate, recognize, follow, spot, make out. see, name, track, or othenvise 
distinguish (henceforth I/Q) said component. 

Taas are often extrinsic, i.e. not part of the component under investigation. For ^^ J 
example, a fluorescent dye molecule is often used as a tag, either for tracking, quantitation. 
5 or both. Likewise, the use of biotin or strepiavidin as tag, linked to a secondary species 
such as an enzyme for ELISA, is widespread. Other forms of tags include, but are not 
limited to. isotopic mass tags for protein I/Q by mass spectrometry', Raman-active tags for 
I/Q by Raman scattering, particulate tags for I/Q by light scattering, fluorescence, 
agglutination, energy transfer, and a variety of other detection mechanisms, including 

10 surface plasmon resonance. 

In this regard, there are almost an infmite number of particulate tags, only a small 
number of which have been previously used. As nanoparticle science is in its infancy (as 
was organic chemistry two centuries ago), one can anticipate that the complexity of 
particulate tags will approach molecular complexity. In other words, we expect that 
1 5 particulate tags might rival the organic molecules currently used as bead tags in 

combinatorial chemistry, in other words thousands to hundreds of thousands or even 
millions of uniquely identifiable tags. We further anticipate that such tags will become 
small enough to allow all intracellular measurements. For example, there are now roughly 
one-half dozen different luminescent semiconducting quantum dot nanoparticles, each 
20 fluorescing at a different wavelength. In theory, one could anticipate production of 
thousands or millions of such orthogonal nanoparticuiate optical tags, although the 
detection mechanism may or may not involve fluorescence (or even other optical methods). 

The same could be said of supramolecular science, and supramolecular tags. We 
anticipate that molecular assemblies held together by non-covaient forces could ultimately 
25 find use as tags. Furthermore, tags could comprise individual molecules either covalently 
or non-covalently associated with biological components. For example, one could imagine 
using electrochemically-active redox tags to uniquely identify components. If one had 10 
different molecules, each with a different redox potential, and each pre-f\inctionalized to 
react with a particular biological component, then one could carry out multiplexed tag I/Q, 
30 using the detection of the redox potential as the identifying characteristic. This is identical 
to the strategy currently used with fluorescence, with redox -space" used in lieu of 
"wavelength" space. 
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Note also that a tag can be a functional group, as in a carboxylate, an amine, a 
sugar, etc.. or even a spin associated with a molecule. For example, we anticipate the 
possibility that two samples could be mixed together, with each sample having one or more 
nuclei imparted with a particular sequence of electromagnetic pulses (of the sort typically 
used in high-field NMR). We further envision that the pulses for two samples would be 
lono-lived enough to compare them using a method of detection. In particular, we envision 
that possibility that the signatures for the t^vo samples would cancel for all species where 
the concentrations are identical, leaving behind a signal only for those species where 
concentrations in the two samples are non-identical. 
) It should be clear to a person skilled in the art that there is no functional difference 

between tags, as defined above, and "reporters" or "reporter molecules", as typically used 
in the chemical and biological literature. Likewise, a "detection molecule" as defined 
below, can itself be a tag (for example when 1/Q is based on mass, as in quartz cr>'stal 
microbalances or piezo inertial biosensors). 
5 As used herein the term "detection molecule" means any molecule or molecular 

assembly capable of binding to a molecule or other species of interest, including but not 
limited to a cell-associated molecule, a soluble factor, or a small molecule or organic 
molecule. Preferred detection molecules are antibodies. The antibodies can be monoclonal 
or polyclonal. Note, however, that as new types of detection molecules are discovered and 
0 popularized, they certainly can be used. For example, aptamers are increasingly being used 
for molecular recognition, and organic chemists have now synthesized a large number of 
molecular receptors. Ultimately, these could be used as detection molecules, either by 
themselves or in association with a lag. 

As used herein the terms "dye", "nuorophore", "fluorescent dye" are used 
15 interchangeably to mean a molecule capable of fiuorescing under excitation by a laser. The 
dye is typically directly linked to a detection molecule in the present invention, although 
indirect linkage is also encompassed herein. Many dyes are well known in the art and 
include, but are not limited to those shown in Table 2. In certain preferred embodiments, 
fluorophores are used which can be excited in the red region (> 600 nm) of the spectnim. 
30 Two red dyes, Cy5 and Cy5.5. are typically used. They have emission peaks of 665 and 
695 nanometers, respectively, and can be readily coupled to amibodies. Both can be 
excited at 633 nm with a helium-neon laser. Seis of 3 red dyes that may be used include, 
Cy5, Cy5.5 and Cy 7 or Cy5. Cy5.5 and Cy 7 .^C. See, Mujumdar eiaL, Bioconjugaie 
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Chemistry, 7:356 (1996); United States Paieni No. 5.268,486; Beavis eLlL, Cytomeio'. 
24:390 (1996); Roderser eiaL, Cytometo', M^'^l (1996); and United States Patent No. 
5,7 14,386. Additional novel dyes useftil for tagging or detection purposes within the 
present invention are descnbed in commonly-owned United States Provisional Application 
5 Number 60/1 42,477, filed July 6, 1 999, entitled "Bridged Fluorescent Dyes. Their 

Preparation and Their Use in Assays," incorporated herein in its entirety by this reference. 

As used herein the term "animal model" refers to any experimental animal system 
in which diseases or conditions with similar pathology and progression to human diseases 
or medical conditions can be developed. Suitable animal systems include, but are not 
10 limited to, rats, mice, rabbits, and primates. In some cases, the disease arises 

spontaneously in the animal model. In other cases, the induction of disease in the animal 
model can result from exposure to the same conditions--for example, infection with a 
pathogen, exposure to a toxin, or a particular diet-that causes the disease in humans. 
Alternatively, the disease or condition can be induced in the animal model with agents that 
1 5 mimic the human disease or medical condition even if the actual initiator(s) of the human 
disease or medical condition is unknown. The disease or medical condition might also be 
induced through the use of surgical techniques. Genetic manipulation of experimental 
animal model systems provides a further tool for the development of the animal models, 
either standing alone or in combination with the other methods of disease induction. 

20 

Preclinical Applications o f Phenotvping 

Currently much effort is being directed towards the identification and analysis of 
biological markers in humans. However, it would be desirable to have a method for 
identifying and analyzing biological markers in experimental animal systems. For 
25 example, biological markers of the progression of a particular human disease could be 

identified in an experimentally-induced animal model of that disease, e.g., the rat adjuvant 
model of arthritis (reviewed in Philippe, et al, American Journal of Physiology 273;R1550- 
56 (1997)). Using the identified markers, the efficacy of experimental therapeutics could 
be determined in the animal model. Therapeutics that have a highly specific effect on the 
30 expression of biological markers in animals, which markers are prognostic or diagnostic of 
the same disease in humans, can therefore be identified without conducting early-and 
hence risky-human clinical trials. Alternatively, novel biological markers can be 
identified in experimental animal models of human disease, and then experiments can be 
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performed to determine whether the same markers, or their human homologues, are 
prognostic or diagnostic of the same disease or medical condition in humans. In some 
cases, biological markers identified in humans can be used to facilitate preclinical tnals 
where animal models can be evaluated by the coiTesponding biological markers. The 
present invention provides methods and mstoi mentation for perfonning such analyses. 

In one series of embodiments of the invemion. the expression of biological markers 
is studied in an animal model of a human disease. There are currently many such models 
and many more being developed, using a variety of different techniques to induce the 
specific disease. In each case, the biological markers of interest can be initially identified 
m prefen-ed embodiments using MLSC. The identified markers can then be studied using 
MLSC to determine the response of the animal to a candidate therapeutic. Because MLSC- 
based assays typically only require small volumes of biological fluid, MLSC is uniquely 
suited for use in animal model systems (especially in rat and mouse) where only limited 
amounts of fluid can be obtained from an animal without sacrificing it. In particular, the 
15 use of MLSC will permit multiple time point analysis of an experimental animal to 
detemiine the pharmacokinetics of a candidate therapeutic. 

In some embodiments of the invention, the animal homologues of known or newly 
identified human biological markers of a particular disease are studied in an 
experimentally-induced animal model of that disease. In many cases, the animal 
20 homologues of human molecules will already be known and characterized. For example, 
through extensive study, a great deal is known about proteins that behave similarly in 
mouse and in humans. The identification of previously unknown ammal homologues of 
human biological markers, and the preparation of reagents that can bind to them, can be 
accomplished through the use of standard molecular biology tecliniques well known in the 
25 art. 

In other embodiments of the invention, novel biological markers--for example, a 
previously unknown pattern of expression of known blood cell-associated proteins-may be 
initially observed in an animal model of a human disease. In this embodiment, the 
relevancy of the identified markers to the progression or development of the human disease 
30 can be determined by identifying human homologues of the biological markers, and then 
studying their expression in humans suffering from the disease of interest. If the identified 
animal biological markers appear to be relevant to the human disease, then they can ser%'e: 
1) as the basis of new diagnostic and prognostic assays for the disease in humans; and 2) 
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as a means for evaluating the spec.r.city and efficacy of candidate therapeuncs m the 

animal model of the disease. 

In one embodiment of the present invention new and improved animal models may 
be developed based on biological markers identified in humans. For example, utilizing the 
biological marker identification system of the present invention U can be found that for a 
gwen^'disease or medical condition that the level of given soluble factor m senim is greatly 
increased, while the level of certain cell population is decreased. Based on this 
infomiation, animal models can be tailored - for example by the use of genetic knockouts 
of homologous factors - to better simulate the disease in the animal serum. 

The phenotyping system of the present invention may also be useful in the 
identification of new or improved animal models. For example, by phenotyping a number 
of genetically altered animals, a fuller picture of the manifestations of the genetic 
aUemations can be recognized. Utilization of this knowledge can be useful in identifying 
new or improved animal models. For example, it may be possible to create a number of 
genetic knock-out mice that all appear to simulate a chosen human disease state. However, 
by phenotyping each of the various knock-outs, as well as humans that suffer from the 
disease, it will be possible to identify the animal model that most closely mimics the human 
disease. 

The present invention can be used in any animal model of a human disease. By 
20 way of illustration only, the present invention can be used to identify and analyze 

bioloaical markers m ammal models of many aspects of cardiovascular disease, including 
hypertension, artherosclerosis. cardiac hypertrophy, atherogenesis. and thrombosis. Many 
animal models of congestive heart failure and hypertrophy are currently being developed, 
and a number are reviewed m: CamieUet, Artherosclerosis, 144:163-93(1999); Young et 
al.. Molecular Basis of Cardiovascular Disease, 37-85 (K.R. Chien, Editor) (1999); 
Hasenfuss, Cardiovasc. Res. 39:60-76 (1998); Krege et al, Fundam. Clin. Cardiol. 26:271- 
92 (1996); Liao et al.. Am. J. Therap. 4:149-58 (1997); and Becker et al.. Hypertension 
27:495-501 (1996) The following is apartial list of some animal models of cardiovascular 
disease: 

. The JCR:LA-cp rat model of human vascular disease can be used to identify and study 
bicmarkers that correlate with insulin resistance, vasculopathy, and cardiovascular 
disease. O'Brien et al. Can. J. Physiol. Pharmacol. 76: 72-76 (1998). 
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. Animal models of insulin-dependent diabetes have been used to study the development 
of ischemic heart disease in the diabetic population. Reviewed m: Pierce et a!.. Can. J. 
\J) Physiol. 75:343-50(1997). 

. Infection of mouse, rabbits and monkeys with Chlamydia pneumonia has been used to 
5 investigate that pathogens role in the development of asthma and cardiovascular disease 

in humans, as reviewed in: Saikku et al. Artherosclerosis, 140 (Suppl. 1). S17-S19 
(1998). 

. Spontaneously hypertensive rat (SHR) strains, and SHR strains carrying a portion of 
chromosome 13 (includmg the renin gene) from nomiotensive rats (SHR.BN-Ren) can 
,0 be used to investigate the interaction between high blood pressure and dyslipidemia in 
cardiovascular disease. St. Lezin et al. Hypertension. 31:373-377 (1998). 
. Spontaneously occurring hypertrophic cardiomyopathy in Landrace pigs may be a 
useful model of cardiovascular disease in humans. Chiu et al., Cardiovasc. Pathol. 
8:169-75 (1999). 

15 . Cardiomyopathic hamster strains can be used to investigate the role of brain and atrial 
natriuretic peptides (BNP and ANP) in human cardiovascular disease. Tamura et al., J. 
Clin. Invest. 94:1059-68 (1994). 
^ . Hypertensive and atherogenic rat strains have been used as models for the study of the 

effect of dietary salt, protein and lipids on the pathogenesis of human cardiovascular 
20 disease. Reviewed in: Yamori et al.. Nutritional Prevention of Cardiovascular Disease 
(Symposium Proceedings) (1984). Published by: Academic Press, Orlando, Fla. 
. Rat, guinea pig. rabbit, dog, sheep, and baboon models of preeclampsia have been used 
to study the pathophysiology this hypertensive disorder of human pregnancy. 
Reviewed in: Hypertension in Pregnancy 12:413-37 (1993). 
95 m other embodiments, the present invention is used to identify and analyze biological 
markers in animal models of innammatory diseases such as arthntis and multiple sclerosis. 

When used to screen candidate therapeutics, the present mvention has a number of 
significant advantages over more traditional screening methodologies. Firstly, clinical 
testing comes at a relatively late stage m the development of the therapeutic, at which point 
30 the therapeutic is known to have a highly specific effect on the expression of analogous 
animal biological markers; this minimizes the nsks to the clinical participants. Secondly, 
using experimental animal models to analyze patterns of biological marker expression 
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means thai only relatively small quantities of the potential therapeutic need be synthesized 
initially, thus reducing the cost of therapeutic development. 

In other embodiments, the methods and systems of the presem mvention are used to 
identify markers of disease or medical conditions in animals for veterinary purposes. The 
identined markers can then be used to screen for candidate therapeutics directed against 
that disease or condition. This embodiment can be applied to domesticated animals, 
livestock and plants. 



5 



Tnitnimentation 

0 Any suitable means for obtaining data that meet the requirements of the data 

categories is within the scope of the invention. In the preferred embodiments. 
Microvolume Laser Scanning Cytometry ("MLSC") is used to obtain the data for cell 
associated molecules and cell type count. In some embodiments, the MLSC technology is 
used with a bead based capture system or with various types of enzyme linked 
1 5 immunosorbent assays (such as ELIS A) to obtain data for soluble proteins. Another 

preferred means for obtaining data for compounds, particularly small molecules, includes 
the use of mass spectrometry. The MLSC technology used in this invention, is a powerful 
method for monitoring fluorescently labeled cells and soluble proteins in blood. This 
technology is currently used in clinical laboratories for the identification of one or two 
20 cellular markers for diagnostic applications. The present invention uses MLSC to facilitate 
the identification of biological parameters. In one embodiment, the present invention 
improves MLSC technology by improving the capacity of the MLSC instrument to do 
simultaneous measurement of multiple biological characteristics or parameters from a 

small quantity of blood. 

Specific enhancements achieved with the instrument of the invention (termed 
"SurroScan instrument") include the followmg: 1) two additional fluorescence color 
channels allow simultaneous detection and measurement of up to four fluorescent colors; 
2) higher laser excitation power improves sensitivity and throughput; 3) disposable 
capillary arrays allow more assays per patient sample using less blood per assay: 4) 
30 improved sof^vare and system integration automates sample measurements and data 

analysis; 5) the capacity of SurroScan instruments is expanded to handle higher volumes 
of patient samples for database creation and biological marker discoveiy. 
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Mirrnvniump I aser Sc pnninn Cvtomenv (MLSC) Svstem Design 

The MLSC technology is described in United States Patent Numbers 5,547.849 and 
5.556.764 and in Dietz et al. (Cytometry 23:177-186 (1996)), each of which is incorporated 
herein in its entirety. The Imagn 2000 system, commercially available from Biometric 
Imaging Inc.. is an example of a MLSC system. Laser scaruiing c>'tometry with 
microvolume capillaries provides a powerful method for monitoring fluorescently labeled 
cells in whole blood, processed blood, and other fluids. The present invention further 
improves MLSC technology by improving the capacity of the MLSC instrument to do 
simultaneous measurement of multiple biological markers from a small quantity of blood. 
A schematic of the improved SurroScan optical system is shown in Figure 2. The preferred 
MLSC instrument for use in the present invention is described in commonly owned United 
States Provisional Application No. 60/144,798, filed July 21. 1999, entitled "System for 
Microvolume Laser Scanning Cytometry" and in the commonly-owned utility application 
filed concurrently with the present invention entitled "System for Microvolume Laser 
15 Scanning Cytometry". Both of these applications are incorporated herein by reference m 
their entirety. 

One embodiment of the improved optical configuration is shown in Figure 2. A 
capillary array 10 contains samples for analysis. In the preferred embodiment, collimated 
excitation light is provided by one or more lasers. In particularly preferred embodiments, 
20 excitation light of 633nm is provided by a He-Ne laser 1 1 . This wavelength avoids 

problems associated with the autonuorescence of biological materials. The power of the 
laser is increased from 3 to 17 mW. Higher laser power has two potential advantages, 
increased sensitivity and increased scanning speed. The collimated laser light is deflected 
by an excitation dichroic filter 12. Upon reflection, the light is incident on a galvanometer- 
25 driven scan mirror 13. The scan minor can be rapidly oscillated over a fixed range of 
angles by the galvanometer e.g. +/- 2.5 degrees. The scanning mirror reflects the incident 
light into two relay lenses 14 and 1 5 that image the scan mirror onto the entrance pupil of 
the microscope objective 16. This optical configuration convens a specific scanned angle 
at the mirror to a specific field position at the focus of the microscope objective. The +/- 
30 degree angular sweep results in a 1 mm scan width at the objective's focus. The 

relationship benveen the scan angle and the field position is essentially linear in this 
confi<'uration and over this range of angles. Furthemiore the microscope objective focuses 
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the incoming colHmaied beam to a spot at the objective's focus plane. The spot diameter, 
which sets the optical resolution, is determined by the diameter of the coUimated beam and 

the focal length of the objective. 

Fluorescence samples placed in the path of the swept excitation beam emit stokes- 
5 shifted light. This light is collected by the objective and coHimaied. This collimated light 
emerges from the two relay lenses 14 and 15 still collimated and impinges upon the scan 
mirror which reflects and descans it. The stokes-shifted light then passes through a 
dichroic excitation filter (which reflects shorter wavelength light and allows longer 
wavelength light to pass through) and then through first long pass filter 17 that further 
1 0 serves to filler out any reflected excitation light. 

The improved instrument of the instant invention then uses a series of further 
dichroic filters to separate the stokes-shifted light into four different emission bands. A 
first fluorescence dichroic 18 divides the two bluest fluorescence colors from the two 
reddest. The two bluest colors are then focussed onto first aperture 19 via a first focusing 
1 5 lens 20 in order to significantly reduce any out-of-focus fluorescence signal. After passing 
though the aperture, a second fluorescence dichroic 21 further separates the individual blue 
colors from one another. The individual blue colors are then parsed to two separate 
. photomullipliers 22 and 2i3. The two reddest colors are focused onto a second aperture 24 
via a second long pass filter 25, a mirror 26, and a second focusing lens 27 after being 
20 divided from the two bluest colors by first fluorescence dichroic 28. After passing through 
aperture 24, the reddest colors are separated from one another by third fluorescence 
dichroic 28. The individual red colors are then parsed to photomultipliers 29 and 30. In 
this way, four separate fluorescence signals can be simultaneously transmitted from the 
sample held in the capillary to individual photomultipliers. This improvement, for the first 
25 time, allows four separate analytes to be monitored simultaneously. Each photomultiplier 
generates an electronic current in response to the incoming fluorescence photon flux. 
These individual currents are converted to separate voltages by one or more preamplifiers 
in the detection electronics. The voltages are sampled at regular intervals by an analog to 
digital converter in order to determine pixel intensity values for the scanned image. The 
- 30 four channels of the instant invention are named channel 0, 1. 2. and 3. 

The new optical layout has four detection channels to allow simultaneous 
measurement of up to 4 fluorescently labeled molecules. In a prefeired embodiment, 
multiple-color assays are used. Typically 3 or more fluorescent colors are used in each 
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assav. Under circumstances where appropriate dye combinations are available, the 
instrument is capable of supporting 4-color assays. 

An XY translational stage is used to move an array of capillaries relative to the 
optical system. The SunoScan system translation stage holds two arrays, each of which 
5 has the footprint of a 96-weU plate. Capillary arrays have been designed which have 32 
fixed capillaries each and spacing that is compatible with multi-channel pipettes. The 
operator is able to load two plates of 32 capillaries at a time. No operator intervention is 
needed while the plates are scanned and the images are processed. As an alternative. 16 
individual capillanes designed for the Imagn 2000 (VC120) are loaded into alternative 
10 holders. 

Image processing software accommodates images with either 2, 3, or 4 colors of 

fluorescent dyes. The software automatically identifies and parameterizes particles 
detected in any of the indi vidual colors. The measured paranieters describing each particle 

are saved in a list-mode format, which is made compatible with conventional cytometry 

1 5 analysis software, such as FlowJo. 

A new disposable cartridge design containing arrays of capillaries has been 
developed and is described in Provisional United States Patent applications, (United States 
Provisional Application Number 60/1 30,876, entitled. "Disposable Optical Cuvette 
Cartridge", filed April 23. 1999; United States Provisional Application Number 
20 60/130.918, entitled "Spectrophotometric Analysis System Employing a Disposable 

Optical Cuvette Cartridge", filed April 23. 1999; and United States Provisional Application 
Number 60/130,875, entitled "Vacuum Chuck for Thin Film Optical Cuvette Cartridge", 
filed April 23, 1999), and the commonly-owned utility application filed April 20, 2000. 
entitled "Disposal Optical Cuvette Cartridge", which are incorporated by reference herein 
25 in their entirety. This capillary cartridge is used in Examples 5, 7 and 8. The design 
currently in use, called Flex-32. contains 32 capillanes. Fill holes in the FLEX32-plates 
have the same 9 mm spacing as 96-well plates and multichannel pipetting devices. It is 
constructed from 2 layers of mylar sandwiched together with a double-sticky adhesive 
layer which is die-cut to define the capillary inner dimensions. The resulting cartridge can 
30 be manufactured at low cost in high volumes. The cartridge is flexible, which allows it to 
be held onto an optically flat baseplate by vacuum pressure, removing the requirements for 
flatness in the manufacturing process. The capillary spacing was designed to retain 
compatibility with multi-channel microplate pipetters and robotics. 
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rellular Assavs ^ 
The invention includes cellular assays, many of which are antibody based, that are ^ 
compatible with mstrumentation, preferably MLSC instrumentation and are capable of 
5 measuring hundreds to thousands of cell populations and their cell associated molecules 
from a single 10 mL tube of blood. In one preferred embodiment, any type of detection 
molecule and assay formal coinpatible with MLSC is encompassed in this invention, 
including, but not limited to cell surface proteins including markers of activation and 
adhesion, intracellular molecules, assays to distinguish changes in activation states of cells, 
10 assays to concentrate and identify rare white cells, assays for use with whole blood, and 
assays for detection of soluble factors, such as proteins, in blood. 

As with flow cytometry, fluorophore-labeled antibodies specific for cell surface 
antigens are used to identify, characterize and enumerate specific populations. The 
reaction can be done in whole blood. In general, there is no need to wash the reagent away; 
1 5 quantitative dilution of the blood-antibody mixture is usually sufficient sample preparation. 
The cell-antibody mixture is loaded into an optical-quality capillary of known volume and 
analyzed with a laser-based fluorescence imaging instrument. In order to operate with 
whole blood, fluorophores are used which can be excited in the red region (> 600 nm) of 
the spectrum. Purified white blood cells can also be analyzed with the instrument. In 
20 contrast to flow cytometry, the laser scans over stationary cells rather than cells flowing 

past the laser. A small cylindrical laser spot ,s scanned across the capillary in one direction 
while the capillary is translated relative to the optical system in a second direction. 
Photomultiplier tubes are used to detect the fluorescent signal. Image-processing software 
is used to analyze the image and identify and enumerate the cells of interest. 
25 This MLSC approach allows one to obtain absolute cell counts on hundreds of 

different cell populations from a single tube of blood. For a set of antibodies to 100 
different antigens, there are about 5000 possible 2-color combinations and about 162.000 
possible 3-color combinations, (n combinations r at a time, nC, = n!/r!(n-r)!) so careful 
thought ,s needed to develop the most appropnate set of 100 or so assays. Multi-color 
30 capability allows more cell populations to be identified with a given amount of blood than 
the original 2-color system. As an example, by multiplexing reagents all populations 
identified in two 2-color assays can be identified m one 3-color assay. For example it is 
possible to assay CD3, CD4 and CDS in one capillary instead of CD3 and CD4 in one 
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capillary and CDS and CDS in another. More imponantly, unique celi populations can be 
defined by the simultaneous expression of three or more antigens. For example, CDS T 
cells can be subsetted into 4 different populations based on the differential expression of 
CD45RA and CD62L. 



Immunoassay Procedures 

Immunoassays can be nin in a variety of formats and any appropriate format is 
envisioned in the present invention. Two examples are given below. The MLSC system 
can be used with microsphere-based immunoassays. In this sandwich assay, the 
microsphere is used as a solid support for an analyte-specific capture antibody. Analyte 
from a biological fluid is bound to the antibody-coated microsphere and detected with a 
second antibody, which is directly labeled with a fluorescent molecule such as Cy5, and 
which binds to a distinct epitope on the analyte. A protocol using amino beads and a 
heterobifunctional crosslinker to covalently attach antibodies via their hinge region works 
well in multiple assays. It is possible to distinguish beads of different sizes (3 to 20 micron 
range) with the MLSC instrument and current software. By coupling different capture 
antibodies to microspheres of different sizes it is possible to multiplex immunoassays in a 
single capillary. Internal indicator dyes can also be used to distinguish microspheres and 

facilitate multiplexing. 

The immunoassays for soluble factors discussed in Example 5 are all 
chemiluminescent - based sandwich ELISA. Microliter plates are coated with capture 
antibodies specific for the analyte of interest and blocked. Biological fluid containing the 
analyte is added, incubated and then washed. Biotinylated antibody specific for a second 
epitope on the same analyte is added, incubated and washed followed by an avidin- 
alkaline phosphatase conjugate. The level of analyte is revealed with a chemiluminescent 
alkaline phosphatase substrate. Plates are read in a Wallac Victor! lummometer or similar 



instrument. 



ne^ion and imole^^nt.tinn for a r o hM-;t panel of cellular assavs to aid the discoveiA^ of 
biological markers for di<;eases or m edical conditions 

The MLSC system is designed to allow rapid staining of cells using minimal 
quantities of blood. Reagents directed against scores of different cell surface antigens are 
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developed, which when combined can identify hundreds of different cell populations. The 
strategy for reagent and combination development is discussed below. 

' A set of monoclonal antibody reagents are employed which are suitable for 
developmg more than 1 00 cellular assays. To date, many (about 1 20) different monoclonal 
antibodies directed against numerous (about SO) different cell surface antigens have been 
successfully.identif.ed and tested with the 2-color MLSC mstmment. The small organic 
dyes like Cy5 and Cy5.5 are readily coupled to the amino groups of antibodies using 
single-step NHS chemistry and well established procedures. Preferred dye-to-antibody 
ratios, have been determined for Cy5. Cy5.5, and Cy7 reagents, and are generally m the 
range of one to four. Protein fluorochromes, like AFC, are linked to the sulfhydryl groups 
of moderately reduced antibody in a 3-step procedure using the heterobifuntional 
crossUnJcmg reagent SMCC. Preparation of reagents containing other Ouorophores is also 
possible. The preparation of Cy7-APC and (Cy7-APC)- antibody conjugates for flow 
cytometry applications has been previously described. The antibody-fluorophore coupling 
chemistry is the same as for APC. All protein-protein conjugates are purified by traditional 
means, such as, by gel filtration on an Akta FPLC. Fluorescent microspheres can also be 
investigated. Antibodies are coupled with 2-step carbodiamide chemistry to carboxylated 
microspheres. 

New monoclonal antibodies reagents are titrated on both whole blood and lysed red 
blood cells. Reagent specificity, and lack of non-specific binding, is confirmed with 
appropriate counter stains. Analysis is done with any appropriate software program, 
including FlowJo cytometry software (Treestar, Inc available as an Internet download at 
t,,.^./..„.„„.r...t.rron./nowio/). From the titration the optimal amount of each reagent 
per assay (typically 0.01 to 2 ng/ml) and preliminary analysis cnteria is determined. In the 
preferred embodiment, all assays are conducted in homogenous (no wash) mode. This 
generally requires that each antibody reagent have a titer point of <1 ^g/ml so that the 
nuorescence background is not too high. A potential difficulty may be that a particular 
reagent may not be amenable to conjugation or may have too high of a titer point. It is 
usually possible to substitute a second monoclonal antibody to the same antigen. There is 
) also a nsk that some individual antigens may not be measurable during the time course of 
the study. Multiple antibodies from each antigen category are ts-pically evaluated. 
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Typically, a panel of about 50-100 (or greater) cellular assays is developed for 
monitoring a disease or medical condition. Such assays enable one to enumerate hundreds 
3 of different cell popiilations. As an example, it is possible to monitor the immune and 

innammaiory cellular parameters potentially significant for rheumatoid arthritis (RA). For 
5 RA populations, the cell surface antigens being evaluated for use may be divided into 
different subsets based on the types of cellular antigens recognized. As an illustration, 
antigens found on the major leukocyte subtypes including T cells, B cells, antigen- 
presenting cells, NK cells, and granulocytes, as well as relevant receptors and structures 
found on these cells are included. These may include activation molecules, co-stimulatory 
1 0 molecules, adhesion molecules, antigen receptors, cytokine receptors, etc. A 

representative, but not exhaustive, list of the amigens that may be evaluated for RA is 
provided in Table I . 



15 



Pellular Assav Fomiats 

The cellular assays described above are designed in either of two formats, whole 
blood or RBC-lysed blood. In the preferred embodiments, the assays are done in whole- 
blood or RBC-lysed blood format. The minimal manipulation ensures that the most 
accurate absolute cell counts (cells/Ml of blood) are obtained. Furthermore only small 
amounts of blood are required per assay so that many assays can be run from a single tube 
20 of blood. However, for some cell populations an alternative assay format. RBC-lysed 

blood, will be preferable. These include particular antigen-antibody pairs for which soluble 
factors (free Ig. soluble cytokine receptors, etc.) contained in the sera interfere with cell 
labeling and populations of cells that are present in ve>7 low frequency. This procedure is 
useful for activated cells expressing CD25 or CD69 which are essemially undetectable in 
whole blood from normal individuals but are increased ten-fold in the lysed fomiat and 
have been shown to be increased in various autoimmune states. Improved detection of 
other minor cell populations such as NK cells has also been demonstrated and should prove 
paniculady useful in analyses. As an example, for a panel of 96 assays, it is estimated that 
■64 will be done on whole blood and 32 on lysed blood. Alternative sample processing may 
30 include, preparation of PBMC by Ficoll gradient, ex vivo stimulation with polyclonal or 
antigen specific activators. 
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Combining antibody reagents is important for the identification of novel cell 
populations that may contribute to the pathogenesis, or be a marker for. diseases or medical 
conditions, such as autoimmune diseases. For example, it is known that adhesion 
molecules can be differentially expressed on T cells thought to be involved in the 

5 autoimmune process. Furthemiore, several studies have indicated that there may be an 
increase in the number of memoo' CD4 T cells in patients with autoimmune disease. With 
the assays of the present invention it is possible to simultaneously look at differential levels 
of adhesion molecules (e.g., CDl la^) specifically on a subset of memory (i.e. CD45RO*) T 
cells of the HLA class Il-restricted lineage (i.e. CD4*). This should increase the ability to 

10 identify relevant disease-related cell populations. Multiple-color capability also allows one 
to look for novel populations of cells by choosing combinations of antigens not typically 
found together on a given cell type or markers found on the same cell type at different 
stages of ontogeny. 

15 neterminatinn nf Appro priate Fluorescent Dyes 

As indicated above any appropriate fluorescent dye is within the scope of the 
present invention. Two commonly used dyes are cyanine dyes Cy5 (em 667) and Cy5.5 
(em 703). Typically, a single dichroic filter to split the emission signal at 685 nm is used. 
More filters will be required when more than two dyes are employed. Dyes are evaluated 
20 to determine their compatibility in the MLSC system. As an example, a variety of dyes 

were evaluated to determine an appropriate overall 3-color set (see table 2). Parameters to 
consider when evaluating dyes include I) spectral separation of the 3 dyes, 2) signal-to- 
noise ratio as a function of laser power, 3) suitability of the available filters. 4) ease of 
conjugation, and 5) specificity of the resulting antibody-fluorophore conjugates. Cy5 and 
25 APC are appropnate for the first color and Cy5.5 is appropriate for the second color. 
Several potential dyes are appropriate for the third color. Cy7-APC is expected to be 
suitable for the MLSC system. Preliminary results with the Imagn 2000 system 
demonstrate that this dye is detectable in the long wavelength chamiel (>685 nm) and 
distinct from both Cy5 and APC. Emission spectra indicate that overlap with Cy5.5 should 
30 not be a problem given appropriate filters for the new instrument. Fluorescent 

microspheres offer a wide variety of alternative colors and have been used successfully in 
some cytometo' applications. Conjugation methods will be used which minimize the non- 
specific binding that occasionally occurs with microsphere reagents. Typically, each of the 
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fluorophores are evaluated in the context of fluorophore-aat.body conjugates using a few 
select antibodies e.g. ant,-CD3, anti-CD4 and anti-CD20. 

<;nUihle Factor .Approaclies 

1 There are a large number of bioanalyses carried out by means other than 

fluorescence. Pronnnen. among these is mass spectrometry., rapidly becoming the tool of 
choice for detailed identification and analysis of polypeptides and proteins. There are two 
widely-used methods for biomolecular sample introduction in mass spectrometry: 
electrospray ionization(ESI) and matrix-assisted laser desorption/ionization (MALDI). 

0 MALDI-TOF is currently successfully utilized for the analysis of proteins, polypeptides 
and other macromolecules. Even though the introduction of an organic matrix to transfer 
energy to the analyte has advanced tremendously the field of desorption mass 
speclrometo', MALDI-TOF still has some limits. For instance, the detection of small 
molecules is not practical because of the presence of background ions from the matrix. In 

1 5 such cases ESI or even gas chromatography (GC) mass spectrometry can be used to detect 
or profile. 

The complexity of molecular structures and heterogeneous nature of proteins 
necessitates the need for multidimensional separation techniques. One of the major areas 
for this is the development of two dimensional gel electrophoresis using polyacrylamide as 
20 the gel matnx for example. The gel is modified m terms of crosslinking, addition of 

detergents, immobilization of enzymes or antibodies (affinity electrophoresis) or substrates 
(zymography) and pH gradient. This technique is used for the characterization of proteins 
in terms of structural modifications, activities, pi values, and molecular weights. 

Another area is the development of multidimensional chromatographic approaches 
25 also referred to as hyphenated separation techniques. The advantages include the ability to 
more accurately quantify the analyte and better compatibility with online detection 
methods like laser induced fluorescence or mass spectrometry. To date usually two 
separation systems are chosen such that they are orthogonal and lead to a better peak 
capacity (resolution). Major technical hurdles are the imegration of the various separation 
techniques with the detection system m terms of maintaining resolution upon transfer to the 
second dimension and the compatibility of the mobile phases with the detection system, for 
examples salts and detergents in the eluant are incompatible with electrospray mass 



30 



ENSOOCID: <WO._ .0065*72A1 I > 



wo 00/65472 PCT/USOO/1 1296 



spectrometry, Naylor .1. Chromatogr. A 1996, 744 237-7S ; Jorgenson .Mial. Chem 1997. 
69, 151S-1524. 

Chemical denvatization can be selectively employed to activate componems in a 
mixture that are not ionized enough to yield an ESI mass spectra. For example sterols 
5 typically devoid of acidic or basic residues that do not ionize under electrospray conditions 
have been coupled with ferrocene carboxylic acids, the electrochemical nature facilitating 
ionization. ,^nal. Chem. 1994, 66, 209-212. Derivatization can also ser^'e as an handle to 
differentiate between stereoisomers (isobaric species) by using different fragmentation 
patterns in their daughter and granddaughter ions of the parents. The MSn capability in an 
10 quadmpole ion trap mass spectrometer for example has been used to distinguish 

hexosamine monosaccharides, glucosamine, galactosamine and mannosamine derivatized 
with CoC12(DAP)2Cl where DAP is diaminopropane. Anal. Chem 1999, 71, 4142-4147. 

Affinity based separation followed by mass speclrometric detection is of clinical 
interest as it allows analysis of complex molecules in biological fluids like blood and urine 
1 5 with little or no sample preparation. Ciphergen's technology (surface enriched laser 
desorption ionization (SELDI), a variation of MALDI) is based on this principle. 
Ciphergen offers 5-6 different surfaces upon which protein and/or small molecules are 
applied, and then washed with increasing stringency. Since each surface/stringency 
combination leads to a different adsorption profile, the technique provides means for 
20 analysis of a complex mixture. 

Clinical Data and Informatics 

The identification and correlation of biological markers with clinical measurements 
requires the integration of vast amounts of biological and medical data and a search engine 
25 that makes such data accessible and usable. The instnimentation and assays developed in 
the present invention have the ability to identify hundreds to thousands of independent 
markers from a small sample of blood. The present invention includes developing a broad 
clinical strategy to collect extensive medical infonnation from patients that are followed 
over the time of disease progression and response to therapy. In addition, the present 
30 invention includes software, databases and data mining tools to correlate patterns of 

markers with specific diseases, disease progression and responses to therapy, including, but 
not limited to, databases of assays and clinical information, data conversion and statistical 
analysis tools, and medical questionnaire prototypes. The information system of the 
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present invention is designed to use common language and common formats for entry of 
disparate types of data and is structured for data-mimng purposes. 

The universal medial language which will likely become widely used in the next 
several years is SnoMed-RT. This language w.U be readily adaptable with the current 
infomiation system of the present invention. Similarly, the present invention is adaptable in 
that as other languages or technologies become available, they may also become 
incorporated into the database. An example would be the eventual development of tools to 
integrate digital x-rays, mammograms, or a virtual colonoscopy which is obtained via a 

C.T. scan of the abdomen. 

The teclinical challenges in developing an infomiatics system capable of handling 
the vast amounts of biological and clinical infomiaiion necessary to correlate biological 
markers with disease include modeling and integrating a number of diverse, complex, and 
often incompatible information sources, adapting to rapid advances in scientific and 
medical knowledge and methods, and developing a user-friendly interface, proper format 
and powerful search tools. The informatics system provided by the present invention meets 
these technical challenges. 

Data Analysis 

The data output from the cellular analyses includes both numbers of cells per ]i\ of 
) whole blood for each population identified, the mean intensity of staining for each cell 

associated molecule, which gives an estimate of the antigen density for a given population, 
the mean size of cells, and the expression levels of a particular molecule. Each number 
will be analyzed, because, as explained above, both the actual cell numbers as well as 
expression levels of a particular molecule may vary in a given disease stale. To identify 
5 markers (cell counts or staining intensity, levels of soluble factors) associated with 
categorical clinical variables (such as diagnosis of disease) a vanety of discriminant 
techniques are used. To identify markers associated with continuous-valued clinical 
vanables (such as levels of soluble factors) a variety of regression techniques are used. For 
both discriminant and regression analyses, stepwise variable selection and cross-validation 
30 are used to identify those markers that are most closely associated with the clinical vanable 
of interest. Where appropriate, demographic and clinical vanables (such as age, gender, 
concomitam drugs, etc.) and genetic parameters are included as covariates in the models. 
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These lecliiiiques are applied in the analyses of data, typically, using the SAS and Siaiistica 

statistical analysis software packages. 

The architecture of the integrated informatics infrastnicture (see Figure 3) of the 
present invention, comprises a mulli-tiered stnicturc. The lowest level consists of a set of 
data sources. The first source composes the scientific data which includes, but is not 
limited to, cellular assay data and soluble factor assay data. The second source may be 
semi structured data which is in a combined form of textual and tabular data describing 
protocols for assay development and protocols for the execution of clinical studies. The 
structure may be encoded as a data type definition (DTD), defining tags that sen'e both for 
information indexing and querying as well as selective information display on web 
browsers. The DTD tags also define an information exchange model enabling the high- 
level electronic sharing of the information with other parties. The third data source is the 
clinical data gathered and restructured to meet the clinical study requirements. Clinical 
questionnaires that are optimized to maximize, under time constraints, the collection of 
useful and quantifiable information from patients, are used to gather information and to 
provide the necessary quality control. If necessary, the questionnaires will be multi lingual 
and adapted to physical challenges (e.g., the inability to use a computer keyboard) that the 
respondents may have to face. The technology of choice for this data source may be XML. 
In addition, the clinical information gathering system also comprises of non-textual means 
of input. A respondent may interact via visual and graphical displays to provide health 
related information by pointing at images of the human anatomy so as to indicate a 
problem without having to articulate it. Other means, e.g. a simple measure of vital 
capacity and FeVlsec. in astlmiatic or emphysematous patients or monitoring devices to 
detect and/or correct cardiac arrhythmias, etc. could also be used for input. The fourth 
source of data is the instr^jmentation data containing all of the relevant parameter settings 
required for the execution of the scientific assays on a combination of different 
instruments, such as Imagn, SurroScan and the ELISA plate reader. 

Additionally, data can be collected and recorded in lists. In list form, measurement 
values for each individual cell are recorded. This facilitates identification and analysis of 
individual cell populations that express a complex set of different molecules. Alternative 
analysis schemes are readily explored, facilitating optimal data analysis. Likewise, the 
complete set of patient data (cell populations, soluble factors, medical history, clinical 
parameters, etc. ) can be stored in lists for each patient sample. 
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As indicated in Figure 3. these data sources are integrated and warehoused using a 
con^mon schema. This schema coordinates the interpretation of the information from the 
constituent data sources. The inteipretation is in a manner that is independent of the logical 
or physical stordge detail of each of the constituent data sources. The common schema 
provides that data sources can be added or modified over time (management of change) 
without significantly affectmg the tool set or user interface that ultimately use the compiled 
data. The common schema provides a buffer betxveen the ever changing data sources and 
the application programs which use the compiled data and derive knowledge from the data. 
Similarly, if in the future additional instrumentation, e.g., NMR, is included for the 
generation of additional data, it can be added without upsetting the organization of the 

already existing warehouse. 

The schema is augmented with an ontology of common concepts and their 
relationships in immunology and related clinical areas. The ontology will be used by the 
data mining tools and by the user interface to assist in the interpretation of user specified 
1 5 requests for information from the underlying data sources and for the specification of data 
mining tasks. The ontology will also be utilized in the verification of the collected clinical 
data. 

The toolkit of programs includes programs for statistical analysis, for data mining 
and for the visualization of the results. A result of the analysis by the toolkit programs 
provides a set of rules relating a set of conditions to a set of consequences. These rules are 
applied over a statistically significant portion of the underlying data and are of the foirn: 
if condl and cond2 and ... and condN then consequence I, consequence2. ... 
For example, when applied to the cellular data source and the clinical data source, 
the toolkit can derive relationships between cellular assay and soluble factor measurements 
that were previously unknown. The results of the analysis by the toolkit are recycled to the 
users and to the database reuse in the future. The architecture is intended to improve the 
knowledge discovery process by storing the accumulated discos ery experience and by 
integrating this experience for continued improvement. 

Other tools for data mining include methods for clustering in highly dimensional 
30 data. These tools are intended to augment and replace the existing method of manual gating 
as presently used. Unlike current cytometry software, which considers one assay from one 
patient at a time, the cytometry tools of the present invention may examine list mode data 
across an assay from multiple patient samples in order to determine the optimal set of 
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circumscribed population (gates). The system is coupled Nvith a multi dimensional 
visualization system that will simultaneously project the computed clusters on selected 
subsets of two-dimensional and tliree-dimensional views. 

The fmal tier is a user interface. This part of the system ser\'es the user interaction 
5 and is used to plan and execute tasks related to clinical studies. Tasks supported at the user 
interface level include, knowledge discovery from study data, clinical study planning, 
protocol planning and evaluation and assay development. 

The user interface will accept requests for information in a uniform way. It may 
combine a graphical interface and may allow for -drilling down" of information from the 
10 abstract concept level to the stored detail. It may allow for information requests that 

include both data and text (e.g., documentation pertaining to assay protocol planning) and 
may allow for interaction over a network. 

F.XAMPLES 
15 Example One 

T rc^ nf thP present invention to identi f y hinlogical markers for Rlieumatoid Arthritis 

The present invention can be used to identify biological markers for rheumatoid 
arthritis (RA). Microvolume laser scanning cytometry (MLSC) is used to help create data 
for identifying biological markers for RA. Marker discovery efforts are focused on readily 
20 accessible biological fluids, most notably blood. A two-color instrument and antibody- 
based assays have demonstrated the potential of this technique for identifying and 
enumerating scores of differem cell populations with only a small amount of whole blood. 
Multiparameter cell analysis, in combination with multiple assays for soluble factors, small 
molecules and an extensive clinical database, is a powerful tool for future biological 
25 marker discovery. Such markers have the potential to lead to new and more effective ways 
to predict and monitor disease activity and responses to therapy. 

Rheumatoid Arthritis is a chronic inflammatory disorder of the small joints, which 
also has pronounced systemic consequences. Although the etiology of the disease is 
. unknown, its pathology evolves with common characteristics over time. Early events 
30 appear to include an inflammatory response initiated by unknown mediators. Activated 
CD4' T cells appear to amplify and perpetuate the inflammation. The presence of 
activated T cells can induce polyclonal B-cell activation and production of Rheumatoid 
Factor (RF). Tissue damage accrues, releasing autoantigens, and the extent of the T cell 
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response broadens. Eventually, the constant innammatory environment may lead to 
transformation of the synovial fibroblasts, yielding destructive potential that is independent 
of T cells and macrophages. The pro-inflammatory cytokines, produced mainly by 
macrophages in the joint and the oiok.nes they induce such as IL-6. are systemically 
active, present in the serum and augment hepatic synthesis of acute-phase proteins. 
Throughout the various stages of the disease, there are changes in the molecules and cells 
in the synovium and blood that have potential to be markers of disease. Blood, because of 
its ready accessibility and circulation tliroughout the body, provides an attractive window 
for monitoring disease activity and is thus the major target of this invention. 

The present invention is useful to identify biological markers of diagnostic and 
prognostic value for Rheumatoid Arthritis. Such markers are required for classifying 
different forms of the disease, for example identifying the subset of patients in whom joint 
erosion occurs more rapidly than in others. Furtheimore. the markers are cntical for 
evaluating the efficacy of intervention and developing early, non-toxic and successful 
5 therapies. Many investigations have been made of cells and soluble factors in blood, 
synovium and urine that are candidate markers for the disease. In general, one to several 
markers at a time have been investigated. While some factors, such as rheumatoid factor 
and C-reactive protein have been associated with RA. there is no consensus panel of RA- 
specific markers. There is a strong need to simultaneously evaluate multiple candidate 
0 markers. This is achieved with multiple assays and by increasing the number of parameters 
(colors) that can be measured in a single assay. 

The present invention is capable of developing a platfomi for identifying markers of 
disease and applymg it to RA. In general, each assay combination consists of one or more 
reagents to identify the major cellular subsets (left column of Table 1). Some of these 
25 antigens, e.g. CD4, are targeted in multiple assays. The major markers are combined with 
differem subsetting antibodies (right column of Table 1) in order to maximize information 
about the sample. Properties of the fluorochomes and the target antigens are considered in 
developing each assay combination. For example, brighter Huorochromes are used with 
less abundant antigens. For other assays it is important to use reagents with the best 
30 spectral differences for certain targets. In general for each antibody tnplet FlowJo software 
is used to analyze 1 to 3 differem 3-color combinations (e.g.. Cy5 CD3, Cy5.5 CD4. 
Cy7APC-CD45RA vs. Cy5 CD45RA, Cy5.5 CD4. Cy7APC-CD3) to determine the best 
combination for distinguishing the different ceil populations. 
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Designing a successful panel of assays requires some empirical knowledge. The 
process is typicaUy an iterative one. with each experiment building upon the previous one. 
As an example, an over^•iew of candidate combinations with potential value for R.^ is 
given below. 

T cells . The major antigens being evaluated in a T cell panel include CD2. CD3, 
CD4, CDS. CD7, and CDS. Many kinds of molecules on these T cell subpopulations can 
be investigated. These include surface antigens which help to distinguish naYve (CD45RA) 
vs. memory cells (CD45RO, CD26), and antigens that play a role in activation (CD25, 
CD69,CD71, HLA class II) or co-stimulation (CD27. CD2S). In addition, markers that 
may play a role in adhesion to innammatory sites are assayed (CD62L, CDUa/CDlS, 
CD44, CD54, and CD58). Subpopulations of T cells based on expression of apTCR, 
ySTCR, and a panel of vp TCR genes are evaluated. 

B cells . The major antigens being evaluated in a B cell panel include CD 19, CD20. 
CD21, CD22, CD23. and CD72. In addition, various markers on these B cell subsets 
including markers that may indicate a more activated phenotype (CD40, CD80, CD86, 
HLA class II, CDS) and those that have been implicated in lymphocyte homing and 
adhesion (CD62L. CD44, CDlla/CD18) are analyzed. IgM. IgG, and IgA receptors for 

specific antigens are also evaluated. 

AniioPn- presentine cells . Antigen-presenting cells are evaluated using markers to 
20 the major antigens CD13, CD14. CD15. and CD33. In addition, a variety of adhesion 
molecules (CDl la, CDI8. CD29, CD44. CD54, CD58, CD62L) and co-stimulatory 
molecules (CD80, CD86) on these cells are analyzed. Other relevant receptors including 
CD16 (FcyRIII), CD32 (FcyRII), and CD64 (FcyRJ) arc assayed. 

n.h.r..lltv pP.and antigens . Only a few studies have investigated the expression 
^S of NK markers and granulocyte markers in R.\ and in general these have given 

inconsistent results. NK subpopulations using the markers CD16, CDS6, CDS7, and NKB 1 
are analyzed. Granulocsaes, including neutrophils and eosinophils, may be phenotyped 
usino CD 13, CD IS. and CD16. A panel of adhesion molecules and receptors similar to 
that described above is used to further subset these populations. 
30 There are many antigens whose expression has been associated with a more 

activated or memory phenotype, implicated in adhesion or co-stimulat.on, or shown to be 
the receptor for an imponant l.gand. Examples are outlined in Table 1 . Several of these 
markers have been examined in several autoimmune conditions and the expression has 
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been found to be variable. For example. T cells from R.^ patients show higher levels of the 
adhesion receptor LFA-1 (CDl 1 a/CDlS) but no change ,n the e.xpress.on of the IL2 
receptor (CD25). which is normally increased on act.vated cells, or a marker for act.vat.on 

and co-stimulation (CDSO). 

Some examples illustrating the kinds of indicators and cell populations that may be 

examined are discussed below. 

J cells. There are several lines of evidence that implicate T cells in RA (Fox. D.A. 
(1997) Arthritis Rheum 40, 598-609). Such evidence includes the association of RA with 
MHC class II alleles that share a common sequence m the third hypervariable region 
(Weyand. CM. and Goronzy. J.J. (1997) Axtn N Y Acad Set 815, 353-6 and Weyand. CM. 
and Goronzy, J.J- (1997) Med Clin North A:.. 81, 29-55). Since CD4^ T cells recognize 
antigen bound to MHC class II antigens, the association of RA with expression of specific 
class II molecules implies a role for CD4 T cells in RA. In addition, studiesin animal 
models of RA, such as collagen induced arthntis or adjuvant arthritis, have shown that T 
cells transferred from affected animals can induce synovitis in susceptible hosts. 
Furthermore, studies in RA patients have shown that strategies aimed at eliminating T cells 
or interfering with T cell function can ameliorate rheumatoid innammation. 

Perhaps more relevant to the present invemion, examination of the phenotype of T 
cells either in the synovial fluid, synovial tissue and/or peripheral blood of RA patients, 
have led to some interesting findings (Cush. J.J. and Lipsky. P.E. (1991) Clin Orthop , 9- 
79) Increased numbers of activated T cells are detectable in the peripheral blood and 
synovial fluid of RA patients. These T cells express CD3 and CD4 cell surface markers at 
a lower antigen density compared to controls, similar to the levels seen in mitogen 
activated T cells in vitro (Luyten, F., Suykens. S.. Veys, E.M,. Van Lerbeirghe. J., 
Ackemian. C, Mielants, H. and Verbruggen, G. (1986) J Rheumatol 13. 864-9). There is 
also a slightly decreased number of CD8^ cells in most active RA causing an increase in 
the CD4/CD8 ratio. In addition T cells from patients with RA express increased amounts 
of the early activation marker CD69 (Pitzalis, C, K.ngsley, G., Lanchbury, J.S., Murphy. J, 
and Panayi, G.S. (1987) J Rheumatol 14, 662-6), increased numbers of CD4XD29^ and 
I CD4XD45RO* memorv cells, and increased expression of MHC class II products 

(Pitzalis, C, Kingsley, G.. Murphy, J. and Panayi, G. (1987) Clin Immunol Immunopathol 
45 752-8). Expression of CD44-dependent pnmar>' adhesion strongly conelates with 
concurrent symptomatic disease in juvenile R.^ and systemic lupus erythematosus (Estess, 
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P., DeGrendele. H.C., PascuaK V. and Siegelman. M.H. (19981 J Clm Invest 102, 11 73-82) 
and may be important in adult R.\. Some studies have shown increased numbers of yb 
ICR T cells and increased HLA expression on these cells (Reme, T., Portier, M.. 
Frayssinoux. F.. Combe, B.. M.ossec, P.. Favier. F. and Sany. J. (1990) Arthritis Rheum 
5 33. 4S5-92). An increase in CDS'S?" cells in RA. sometimes associated with restricted 
TCR has also been reported (Morley, J.K., Batliwalla, P.M., Hingorani, R. and Gregersen. 
P.K. (1995) J Immunol 154, 6182-90 and Serrano, D.. Monteiro. J., Allen. S.L., Kolitz, J.. 
Schulman, P., Lichtman, S.M.. Buchbinder, A., Vinciguenra. V.P.. Chiorazzi, N. and 
Gregersen. P.K. (1997) J Immunol 158. 1482-9). Three-color assays can monitor restricted 
10 vp expression on this specific T cell subset. 

B cells. Phenotypic analysis of B cells has also been performed in RA patients. A B 
cell subpopulation expressing the pan T cell marker CDS has been shown to be elevated 
(Sowden, J.A., Roberts-Thomson, P.J. and Zola. H. (1987) Rheumatol Int 7, 2S5-9, Hardy, 
R.R, Hayakawa, K., Shimizu, M., Yamasaki. K. and Kishimoto, T. (1987) Science 236, 
15 81-3 and Casali. P.. Burastero, S.E.. Nakamura. M., Inghirami, G. and Notkins, A.L. (1987) 
Science 236, 77-81). This subset is also elevated in autoimmune mice where IgM 
autoantibodies have been shown to be constitutively expressed (Hayakawa, K. and Hardy, 
RR. (1988) Annu Rev Immunol 6, 197-218). In humans however CD5* B cells do not 
preferentially produce autoantibodies (Suzuki, N., Sakane. T. and Engleman, E.G. (1990) J 
Clin Invest 85, 238-47) and the role of CD5^ B cells in the pathogenesis of autoimmunity 
in humans is still unclear, perhaps reflecting the presence of activated B cells (Wemer- 
Favre, C, Vischer, T.L., Wohlwend, D. and Zubler, R.H. (1989) Eur J Immunol 19, 1209- 
13). Circulating B cells from RA patients also demonstrate increased expression of HLA 
DR molecules, again indicative of an activated B cell phenotype (Eliaou, J.F., Andary, M.. 
25 Favier, F., Carayon. P., Poncelet, P., Sany, J., Brochier, J. and Clot, J. (1988) 

Autoimmunity 1, 217-22). Three-color assay are able to monitor increased HLA class II 
expression specifically on CDS" CD 19 B cells. 

Antigpn- pre^^enting cells . Several cell types can sei^'e as antigen-presenting cells, 
including monocytes, macrophage, dendntic cells, B cells and other cells induced to 
30 express class II antigens. In general these cells show an activated phenotype demonstrated 
by increased expression levels of HLA class II antigens m patients with autoimmune 
disease (Lipsky, P.E., Davis, L.S.. Cush, J.J. and Oppenheimer- Marks. N. (1989) Sponger 
Semin Immunopathol 11, 123-62). Antigen-presenting cells are abundant in the synovial 
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companmem (V.ner, N.J. (1995) Br Med Bull'51. 359-67) and blood-denved macrophages 
have been associated with human cartilage glycoprotein 39 expression in some studies 
D (Karlcpatnck. R.B.. Emer,-, J.G.. Connor. J.R- Dodds. R.. Lysko, P.G. and Rosenberg. M. 

(1997) Exp Cell Res 237. 46-54). 
S c;„,..hl. facor assavs. Soluble factor assays provide an additional battery of 

potential biological markers. There are many m^portant soluble factors that have been 
identified m RA patients. These include levels of c.rculatmg cytokines such as TNFa and 
IL-6. cytokine receptors, chemokines. rheumatoid factors of different isotypes, 
immunoglobuHn with different fomis of glycosylation, hom.ones. acute-phase proteins 
,0 such as C-reactive protein and serum amyloid A. and soluble adhesion molecules, as well 
as matrix metalloproieinases and their inhibitors. Many of these soluble factors are known 
to be present at varying levels in RA patients at different stages of disease (Choy. E.H. and 
Scott, D.L. (1995) Drugs 50. 15-25, Feldmann. M., Brennan, P.M. and Maim. R.N, (1996) 
Annu' Rev Immunol 14, 397-440. and Wollhe.m. F.A. (1996) Apmis 104, 81-93). 
15 Therefore, assays can be conducted to measure these soluble factors and look for stanst.cal 
correlations with the cell populations identified. 

M.Hir..l histories. In addition to soluble factors, information in the medical histo.7 
of patients are included in the database. The clinical parameters will include information 
on age gender, stage of disease, outside laboratory tests such as ESR, previous therapy and 
20 any concomitant dmgs or therapies. This information is relevant to the evaluation. For 
example, it is known that immunosuppressive drugs, such as those often taken by RA 
patients, can have a profound effect on the expression of cell surface antigens. Patients 
treated with methotrexate show a decrease m CDIQ" and CD5M9^ B cells. Patients treated 
with cyclophosphamide show a decrease in activated T cells expressing CD25 or HLA DR. 
95 Patients treated with prednisone express several changes in cell surface phenotype. 

including a decrease in activated CD3^25^ T cells, a decrease in CD5' 1 9' B cells, and a 
decrease in CD16* and CD56* NK cells (Lacki. J.K. and Mackiewicz, S.H. (1997) Pol 
Arch Med Wewn 97, 134-43). Other clinical vanables such as disease duration may also be 

useful. 

30 TV:,,^inop.,ien,nonulations. A review of the cellular assay literature as it 

relates to autoimmune disease reveals that there are apparently conflicting reports. For 
example, some reports indicate an increase in levels of CD5 B cells (Markeljevic, .1., 
Batinic. D.. Uzarevic, B., Bozikov, J.. Cikes. N.. Bab.c-Naglic. D., Hor^-at, Z. and Mar^sic. 
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M. (1994) J Rheumatol 21 , 2225-30) in RA patients, while other studies do not (Liu, ST., 
Wano C.R.. Liu, M.F., Li. J.S., Lei. H.Y. and Chuang, C.Y, (1996) Clin Rheumatol 15, 
250-3). These publications suggest thai there may be other confounding factors that have ; 
important implications for cellular phenotypes, and perhaps cellular function, in RA 
patients. Segregating the patient populations selected for study, based on levels of soluble 
factors circulating in the serum, stage of disease, and therapy could in part explain the 
apparent discrepancy with respect to the CDS' 19* B cell levels in RA patients discussed 
above. For example, it is known that there is a significant correlation bet%veen the levels of 
IgM rheumatoid factor and the percentage of CDS B cells (Youinou, P., Mackenzie, L.. 
Katsikis, P., Merdrighac, G., Isenberg. D.A., Tuaillon, N., Lamour, A, Le Goff, P.. 
Jouquan, J., Drogou, A. and et al. (1990) Arthritis Rheum 33, 339-48). Furthermore the 
level of IgA rheumatoid factor is associated with the level of CDS B cells as well as 
CD4*^CD45RO* T cells (Arinbjamarson, S., Jonsson, T., Steinsson, K., Sigfusson. A., 
Jonsson, H., Geirsson, A., Thorsteinsson. J. and Valdimarsson, H. (1997) J Rheumatol 24. 
269-74). Simultaneous measurement of multiple parameters increases the probability of 
identifying key variables for segregating patient groups. 

This generic example illustrates that this invention is uniquely suited for identifying 
ensembles of biological markers to characterize diseases. The MLSC technology, which 
requires only a very small sample volume, provides that numerous assays can be completed 
on a single blood sample and ensures that the maximum amount of biological information 
can be acquired. The biological marker identification system can accommodate a mixture 
of assay types, including whole blood and RBC-lysed blood, among others. The assays 
conducted are considered relevant for the clinical indication and allow a broad survey. 
Relevant biological markers can be identified using the technology of the present 
invention. 

F-xamole Two 

Use of the present invention to identify biological markers for Multiple Sclerosis 

The present invention can be used to identify biological markers for Multiple 
Sclerosis (MS). The biological marker identification system is employed to identify 
markers for Multiple Sclerosis. MS is an autoimmune inflammatory disease of the central 
nervous system. MS is characterized clinically by relapsing and remitting episodes of 
neurologic dysfunction. The etiology of the disease remains unknown, however the 
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presence of innammator>' cells in the brain, spmal cord, and cerebrospinal Ouid impl.es 
that an immune attack against CNS myelin is central to the pathogenesis of MS. The 
hallmark of the MS lesion is an area of demyeUnaiion called a plaque that may be found 
throughout the brain and spinal cord. Inflammatory cells are seen at the edges of the 
plaque and scattered tluoughout the white matter. The main inflammatory cells include 
activated K-mphocytes and monocyte denved macrophages. CD4 T cells accumulate at the 
edges of the plaque: CDS T cells are not found as frequently in active disease, but are 
present in longstanding lesions. Autoreacttve T cells recognizing myelin basic protein and 
other non-mvelin self-antigens circulate in the blood and upon activation can pass through 
the blood-brain barrier. Up-regulatior. of adhesion molecules, histocompatability antigens, 
and other markers of lymphocyte and monocyte activation (IL2R. FcR) are all connected 
with the activation and homing process. Furthermore, there is in increase in 
proinflammatory cytokines that serves to amplify the immune response. The a^ 

response also includes pronounced B cell stimulation. The autoantibodies produced can 
activate the complement system and promote demyelination. Throughout the various 
stages of disease, there are changes in the molecules and cells in the CNS and the blood 
that have potential to be markers of disease. 

The present invention can identify disease markers of diagnostic and prognostic 
value for Multiple Sclerosis. Such markers are valuable for classifying different forms of 
the disease, for example identifying the subset of patients with relapsing-remitting disease 
who are most likely to develop those secondary progressive disease. Furthermore, the 
markers are valuable for evaluating the efficacy of mtervention and developing early, non- 
toxic and successful therapies. Many investigations have been made of cells and soluble 
factors in blood, cerebrospmal fluid (CSF) and unne that are candidate markers for the 
disease. In general, one to several markers at a time have been investigated. While some 
factors, such as oligoclonal immunoglobulin m the CSF, have been associated with MS, 
there is no consensus panel of MS-specif.c markers. There is a strong need to 
simultaneously evaluate multiple candidate markers. 

T cells . There are several lines of evidence that implicate T cells in MS. Such 
3 evidence includes the association of MS with MHC class II (particularly HLA DR) alleles 
(Hauser, S.L., Fleischnick, E., Weiner, H.L., Marcus, D.. Awdeh. Z., Yums, E.J. and Alper, 
C.A. (1989) Neurology 39, 275-7). Since CD4' T cells recognize antigen bound to MHC 
class II antigens, the association of MS with expression of specific class II molecules 
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implies a role for CD4 T cells in MS. In addition, studies in animal models of MS such as 
mouse or rat experimental allergic encephalomyelitis have shown that myelin antigen 
specific CD4 T cells can induce disease when adoptively transferred to naive animals , 
(Cross. A.H. and Raine, CS. (1990) J Neuroimmunol 28, 27-37 and Cross. A.H., Camiella. 
B.. Brosnan. C.F. and Raine, CS. (1990) Lab Invest 63, 162-70). Furthemiore, studies in 
MS patients have shown that strategies aimed at eliminating T cells or interfering with T 
cell function can slow progression of MS. 

Perhaps more relevant to the present invention, studies examining the phenotype of T 
cells, either in the cerebrospinal fluid and/or peripheral blood of MS patients have led to 
some interesting findings. There is a reduction in CD8^ T cells in the blood of MS paticms. 
The subset showing the most marked decrease was the CDS^CDl Ib^ subset (Ilonen. J., 
Surcel. H.M., Jagerroos, H.. Nurmi, T. and Reunanen, M. (1990) Acta Neurol Scand 81, 
128-30 and Oksarama, O.. Tarvonen. S.. Ilonen, J , Poikonen, K., Reunanen, M.. Panelius, 
M. and Salonen. R. (1996) Neurology 47, 1542-5). There is also an increase m activated T 
cells bearing the CD71 and CD25 markers particularly in active MS (Gene. K.. Dona, D.L. 
and Reder, A.T. (1997) J Clin Invest 99, 2664-71 and Strauss, K., Hulstaert, F., Deneys, V., 
Mazzon, A.M., Hannet, I., De Bruyere, M., Reichert,T. and Sindic, C.J. (1995) J 
Neuroimmunol 63, 133-42). Furthermore, the majority of T cells in the cerebrospinal fluid 
and peripheral blood show a memory phenotype with high levels of CD45RO and CD29 on 
both the CD4 and CDS T cell populations (Vrethem. M., Dahle, C, Ekerfelt, C. Forsberg, 
P., Danielsson. O. and Emerudh, J. (1998) Acta Neurol Scand 97. 215-20). This leads to a 
reduction in CD4XD45RA^ (Strauss, K., Hulstaert, F., Deneys. V., Mazzon, A.M.. 
Hannet, I., De Bmyere. M., Reichert. T. and Sindic, C.J. (1995) J Neuroimmunol 63, 133- 
42) and CD8*CD27-CD45RA* (Hintzen, R.Q., Fiszer, U., Fredrikson, S., Rep, M.. Polman, 
C.H., van Lier, R.A. and Link, H. (1995) J Neuroimmunol 56, 99-105) naYve T cells in the 
penpheral circulation. A recent study has concluded that CD4-. CD4^ SLAM\ and 
CD4*CD7* cells (preferentially T helper I cytokine producing cells) are increased in MS 
paliems relative to controls (Ferranle, P., Fusi. M.L., Saresella. M., Caputo, D., Biasin, M., 
Trabattoni, D., Salvaggio, A.. Clenci, E., de Vnes. J.E., Aversa, G., Cazzullo, C.L. and 
Clenci, M. (1998) J Immunol 160, 1514-21). Furthennore some studies have shown 
skewed TCR variable beta usage in the peripheral blood of MS paiiems indicative of a 
restncted TCR repertoire (Gran, B., Gestri, D., Sottini, A., Quiros Roldan, E., Bettinardi. 
A., Signorini, S., Pnmi, D., Ballenm, C, Taiuti, R.. Amaducci, L. and Massacesi. L. (1 998) 
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J Neuroimmunol S5. ::-32). A restncied pattern of gene rearrangement has also been 
described in the y5 T cell subset (Michalowska-Wender. G.. Nowak, J. and Wender, M- 

■"^ 

(1998) Folia Neuropathol 36. lo). 

B cells . Phenotypic analysis of B cells has also been performed in MS patients. A 
5 B cell subpopulation expressing the pan T cell marker CDS has been shown to be elevated 
(Mix. E.. Olsson, T.. Correale, J.. Baig, S.. Kosiulas. V.. Olsson. O. and Link, H. (1990) 
Clin Exp Immunol 79, 21-7). This subset is also elevated in autoimmune mice where they 
have been shown to constitutively express IgM autoantibodies (Hardy. R.R., Hayakawa, 
K., Shimizu, M.. Yamasaki, K. and Kishimoto, T. (1987) Science 236, Sl-3). In humans, 
10 however. CDS* B cells do not preferentially produce autobodies (Suzuki, K., Sakane, T. 
and Engleman. E.G. (1990) J Clin Invest 8S, 238-47) and the role of CDS^ B cells in the 
pathogenesis of autoimmunity in humans is still unclear, perhaps reflecting the presence of 
activated B cells (Wemer-Favre, C. Vischer. T.L., Wohlwend, D. andZubler. R.H. (19S9) 
Eur J Immunol 19, 1209-13). Consistent with this conclusion, high levels of the memory 
1 5 marker CD45RO were found on circulating CD20* B cells from patients with MS 

(Yacyshyn. B., Meddings, J., Sadowski, D. and Bowen-Yacyshyn, M.B. (1996) Dig Dis 
Sci 41, 2493-S). The number of circulating CD80* B cells is also increased significantly in 
- J MS patients with active disease, but is normal in stable MS (Gene, K.. Dona. D.L. and 

Reder, A.T. (1997) J Clin Invest 99. 2664-71). 
20 Ant;ppn- prP..enting cells . Several cell types can sen/e as antigen-presenting cells, 

including monocytes, macrophage, dendritic cells. B cells and other cells induced to 
express class II antigens. In general these cells show an activated phenotype demonstrated 
by increased expression levels of HLA class II antigens in patients with active MS (Gene. 
K., Dona, D.L. and Reder, A.T. (1997) J Clin Invest 99, 2664-71 ). A recent study has also 
25 shown that CDS6 and CD95 (fas) expressing monocytes are increased in MS as compared 
with healthy controls (Gene, K.. Dona. D.L. and Reder, A.T. (1997) J Clin Invest 99, 2664- 

71). 

Other cell types . Only a few studies have looked at the expression of NK markers 
and granulocNte markers in MS. One study shows a decrease in CD16* NK cells in 
30 chronic, progressive MS (Kastnikoff, L.F.. Morgan. N.G., Aziz. T.M.. Zecchini, D,. 
Berkowitz, J. and Paty, D.W. (1988) J Neuroimmunol 20, lS-23). 
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Soluble factor assavs . Soluble factor assays provide an additional battery of 
potential biological markers. There are many important soluble factors that have been 
identified in MS patients. For example, levels of soluble Apo A-l/Fas (Ferrante, P., Fusi. 
M.L.. Saresella, M, Caputo, D., Biasin, M, Trabattoni, D., Salvaggio, A.. Clerici, E., de 
5 Vries. J.E. Aversa, G. Cazzullo, C.L. and Clerici, M. (199S) J Immunol 160, 1514-21) is 
augmented in acute MS compared with the levels seen in patients with stable disease or 
healthy controls. In addition, levels of soluble adhesion molecules such as soluble 
intracellular adhesion molecule I (ICAM-l) (Giovannoni, G., Lai, M., Thorpe, J., Kidd, D., 
Chamoun, V., Thompson, A.J., Miller, D.H., Feldmann, M. and Thompson, E.J. (1997) 
10 Neurology 4S, 1 557-65) and soluble E -selectin (Giovannoni, G., Thorpe, J,W., Kidd, D., 
Kendall, B.E., Moseley, I.F„ Thompson, A.J., Keir, G., Miller, D.H., Feldmann, M. and 
Thompson, E.J. (1996) J Neurol Neurosurg Psychiatry 60, 20-6) have been shown to be 
increased in MS patients at different stages of disease. Proinflamatory cytokines like 
TNFa and IFNy, are known to be present at varying levels in MS patients at different 
1 5 stages of disease (Navikas, V. and Link, H. (1996) J Neurosci Res 45, 322-33). Other 
relevant proteins, such as cytokines and cytokine receptors, chemokines, matrix 
metalloproteinases and their inliibitors, neopterin, and myelin basic protein, have also been 
shown to be present at varying levels in MS patients at different stages of disease and 
healthy controls. Therefore, assays can be conducted to measure these soluble factors and 
20 look for statistical correlations with the cell populations identified. 

Medical histories and distinguishing patient populations . In addition to soluble 
factors information in the medical history of patients will be included in the database. The 
clinical history will include infomiation on age, gender, stage of disease, outside laboratory 
evidence (magnetic resonance imaging, cerebrospinal fluid analysis for oligoclonal 
25 immunoglobulin and evoked potential recordings), previous therapy and any concomitant 
dmgs or therapies. This information is relevant for segregating patient populations. 

It is evident that treatment effects play a role in the phenotype of the cells. While 
untreated MS patients display a greater population of CD3XD4"CDS* circulating T cells 
compared with healthy donors, this population of cells is reduced following corticosteroid 
30 treatment [30]. In addition, the number of CD71+ and HLA DR+ lymphocytes and 

monoc>aes is increased in active MS. However therapy with IFNp-lb reduces the number 
of activated HLADR^. CD1\-^ and CD25-^ cells. Furthermore, although the number of 
circulating CD80+ B cells is decreased, the number of CDS6+ monoc>aes is increased 
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following therapy [14]. Other clinical vanables such as d.sease duration may also be 
useful. For example it has been shown that in MS patients with restricted TCR VP 
repertoires, the median disease duration ts shorter than in pat.ents who do not have a 
restricted repenoire (Gran, B.. Gestn. D., Sottim. A.. Quiros Roldan, E., Bett.nardt, A.. 
Signonni. S., Primi. D., Ballenni. C, Taiuti, R.. Amaducci. L. and Massacesi, L. (199S) J 
Neuroimmunol S5. 22-32). 
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F.x-am ple Three 

c;...Hv rnmnarino ot^^.-mntnid Arthritis Patient to Tontrol Patient Blood 

In a pilot study, a panel of 40 two-color cellular assays was prepared for the Imagn 
2000 and evaluated blood samples from about 50 donors. Half of the samples came from 
the Stanford Blood Bank and half came from the Rlieumatology Clinic at Stanford 
University. The study was designed to evaluate and develop key components of the 
biomarker search engine of the invention: instruments, assays and analytical tools. It was 
not necessarily designed to elucidate biomarkers. All assays were done on whole blood, 
v^ithout washing to remove unbound reagents. Thirty-eight of the assays comprised 27 
different antibody reagems to 23 different cell surface antigens. Eighteen were conjugated 
to Cv5 and nine were conjugated to Cy5.5. Each of these cellular assay comprises one 
antigen conjugated to each dye to make up a two-color combination. Two assays monitored 
cell viability with a DNA intercalating dye. The cellular assays allowed us to identify 
approximately 100 different cell populations including sets of T cells, B cells, NK cells, 
monocNtes, and granulocytes. 
Methods 
Cellular assays 

The panel of assays is shown in Table 3. Each reagent is tested and titrated before 
preparing the reagent combinations in order to optimize assay performance. 
Sample preparation 

For this study all cellular assays were applied to whole blood, in homogeneous 
mode (no post stain washing). Aliquots (20 uL) of fluorescently labeled antibody reagent 
combinations of DNA dye were distributed with a multi-channel pipette from prepared 
racks into discrete wells of a microtiter plate. Whole blood or diluted whole blood (30 uL) 
v..as added with a multi-channel pipette and the sample mixed. Cells were incubated for 20 
minutes followed by the addition of 1 00 uL of diluent and mixing. A portion of each 
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Stained sample (50 uL. corresponding to 10 uL of blood) was added to volumetnc 
capillaries (VC120) and loaded into the modified Imagn 2000 instrx.ment. Scans were 
initiated and executed w.thout operator inter^.em,on. Data files are transferred over the 
computer network and convened to the Flow Cytometry Standard format. FlowJo 
5 cytometn- software was used to identify cell populations and obtain numerical values for 
cell counts (cells per uL), relative cell size, and relative fluorescence intensity, which ,s an 
estimate of the antigen density for each gated (boxed) cell population. 
"soluble factors 

Serum levels of C-reactive protein were measured on the Imagn 2000 with abead- 
10 based immunoassay. Beads coated with anti-CRP antibody were used to capture the 
analyte. Cy5 conjugated anti-CRP antibody was used reveal the captured analyte. 

Patient medical information 

An abbreviated medical history, including age, gender, parameters of disease 
severity, co-morbidities and medications was obtained fi-om each patient. Data for the 
1 5 blood bank samples was limited to age and gender. 

Database and statistics 

Data output from the cellular assays, soluble factor assay and medical histories 
were combined into a single database. To identify potential biological markers (cell counts 
or staining intensity, serum concentration) associated with categorical clinical variables 
20 (such as diagnosis of disease) a variety of discnminant techniques was used including 
Fisher linear and quadratic discriminant analysis, logistic regression, and classification 
trees. To identify markers associated with continuous-valued clinical variables (such as 
erythrocyte sedimentation rate) we use a variety of regression techniques including 
multivariate linear regression and regression trees. For both d.scnminant and regression 
25 analyses, stepwise variable selection and cross-validation was used to identify those 
markers that are most closely associated with the clinical vanable of interest. Where 
appropriate, demographic and clinical variables (such as age. gender, and concomitant 
dnigs) were included as covariates in the models. These techniques were implemented and 
applied using the SAS and Statistica statistical analysis software packages. 
30 Results 

rnmmnn analvsis srraipaies can h e u.sed for most patient samples 

Although one of the drawbacks of cell population studies is often the variability 
among donor samples, identical analysis windows (gates) were used across all donors for 
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95% of ihe cell populations analyzed. This demonstrates the robustness and consistency of 
these assays and cell analysis systems. The remaining 5% of the gates were adjusted to 
account for new populations appeanng in certain donors or a reagent that appeared 
unreliable for a few donors. In the latter case the problem reagent can be replaced with an 

5 improved version in future studies. 

An example of a 2-color combination with variation among donors is shown in 
Figure 4. The cells were stained with CD27 conjugated to Cy5 in combination with CDS 
conjugated to Cy5.5. This combination allowed CDS' T cells (MHC class 1 restricted) to 
be momtored. which arc CD27* (activated) and CD27-. CDS", CD27* cells (which are 
10 actually activated CD4, MHC class II restricted, T cells) are also detected. Although there 
is variation among the donors, a single gating strategy can be implemented. Three cell 
populations are identified which differ among the donors. In Figure 4A the majority of 
CDS* T cells are CD27 negative. In Figure 4B the majority of CDS* cells are CD27 
positive. Finally, in Figure 4C. the CDS population is split between those that are CD27 
1 5 positive and those that are CD27 negative. FlowJo, our cytometry software, calculates the 
cell count for each of the gated populations. In addition, the mean fluorescence intensity 
for each antigen was obtained. This is indicative of the antigen density on the cell surface. 
^ The relative cell size for each cell population was also obtained. The numbers were 

compared and compute statistics across all donors. The differences shown here with 
20 respect to CD27 expression on CDS* cells are typical of the kinds of changes that are 
observed when comparing patient and control populations in our clinical study. 
Excellent correlation among related measure ments 

Another goal of this imtial study was to assess the robustness of the 2-color Imagn 
system and develop statistical tools. The study was designed so that several capillaries 
25 contained the same antibody reagent conjugated to either the same or the alternative dye. 
This allowed the same measurement to be obtained any^vhere from 2-6 times from tiie 
same donor for CD3. CD4. CDS. CD7. CDS, CD19, CD20 and CD27 amigens. The same 
cell populations were also measured using antibodies to different antigens found on ihem. 
For example, total T cells were enumerated using CD3 as well as CDS. B cells were 
30 enumerated using CD 1 9 as well as CD20, etc. In this preliminary study, excellent 
consistency was seen both between capillaries comaimng the identical reagent and 
capiUanes containing different antibodies staining similar cell populations. Correlation 
coefficients for the same antigen across different capillaries averaged 0.94. The correlation 
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coefficient was 0.97 for bo.h CD3 vs. CDS and CD 1 9 vs. CD20. Examples of these 

correlations are given in Figure 5 and Figure 6. 

r>;ffPrPnre.. are o^— -".nr^9 RA an d hlood hank samples 

Several measured parameters were used to segregate general blood bank samples 
and RA patient samples as shown .n Table 4. The best single markers accurately segregate 
80 to 86% of the sample (7 to 10 mcorrect assigru.nents). Some, two cell population pairs 
segregate 90% of the samples, suggesting that sets of cell populations may be more useful 
than single cell populations for segregating patient populations. 



10 F.vam ple Four 

Pv pandpd RA Study 

' This Example expands the measurement capabilities in an RA study. Cell 
populations and soluble factors from rheumatoid arthritis (RA) patients were monitored. 
The RA patients were part of a clinic study, receiving methotrexate and either ARAV A or a 
1 5 placebo. Patients were momtored longitudinally over about 2 months. At each time point, 
cell population data, soluble factor data, and clinical information was collected. 
relluiar assays 

Most of the cellular assays are done in whole-blood format as described in Example 
three The minimal manipulation ensures that the most accurate absolute cell counts 
.0 (cells/Ml of blood) are obtained. Furthem^ore only small amounts of blood are required per 
assav (40 ul) so that many assays can be run from a single tube of blood. However, for 
some cell populations an alternative assay format, RBC-lysed blood, is preferable. These 
include particular antigen-antibody pairs for which soluble factors (free Ig, soluble 
cytokine receptors, etc.) contained in the sera interfere with cell labeling and populations of 
.5 cells that are present in very low frequency. For example the RBC-lysed sample 

preparation is useful for activated cells expressing CD25 or CD69 which are essentially 
undetectable in whole blood from nomnal individuals but are increased ten-fold in the lysed 
fomiat and are likely to be increased in various autoimmune states. Improved detection of 
other minor cell populations such as NK cells has also been demonstrated. 
30 For this protocol, the cellular assays included a panel of 60 2-color combinations 

compnsing 46 whole blood assays and 14 RBC-lysed whole blood. A total of 39 different 
antibody reagents (30 conjugated to Cy5 and 9 conjugated to Cy5.5), targeting 35 distinct 
cell surface antigens, were used. All assays are done in homogeneous mode (no wash after 
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staining). This assay panel enables the identificaiion of more than 150 different cell 
populations. The reagent combinations and the cell populations that can be identified are 
provided in Table 5. 
Soluble Factor Assays 

5 Sera are aliquoted and frozen for each blood sample for subsequent nieasuremenl of 

multiple soluble factors. These include levels of circulating cytokines such as TNFa and 
lL-6, c>^okine receptors, chemokines, rheumatoid factors (RF) of different isotypes, 
immunoglobulin, acute-phase proteins such as C-reactive protein and serum amyloid A, 
and soluble adhesion molecules, as well as matrix metalloproteinases and their inhibitors. 

10 The initial panel of 22 soluble factors assayed is shown in Table 6. Additional targets are 
also provided in Table 6. All assays are done in a sandwich ELIS A format using matched 
antibody pairs to ensure the required sensitivity and specificity. 
Patient medical information " 

A medical history with more detailed disease-specific infonmation is included with 

1 5 each sample in the study. 
Example Five 

Cellular Assays on a 4-channel MLSC Instumcnt 

More assays, with greater infomiation content per assay, can be run on the 4- 

20 channel SurroScan instrument. Assays are developed using 3 color reagent combinations. 
Effective dye combinations include Cy5, Cy5.5 and Cy7 and CyS, Cy5.5 and Cy7-APC 
allow simultaneous and independent measurment of three target antigens. Three color 
combinations facilitate the acquisition of more information per capillary than 2 color 
combinations by 1) eliminating redundancy (e.g. measuring CD3, CD4 and CDS in one 

25 capillary instead of measuring CD3 + CD4 and CD3 + CDS m nvo capillaries) and 2) 
identifing new populaitons that are defined by the simultaneous expression of 3 antigens 
(e.g. naive CD4" T cells that express both CD45RA and CD62L). Given appropriate 
fluorescent dyes with distinct emission spectra, it is possible to simultaneously monitor 
additional target antigens either in the fourth channel, or in some cases, in the existing 

30 channels. Figure 7 provides the results of a 3-color assay on the SurroScan instrument. 

Assays on the SurroScan instrument can be executed with capillary arrays which 
use about 1/3 less sample than the VC120 capillaries. For whole blood assays it is 
possible to process 10 uL or less per 3 color assay, giving the potential for up to 1000 
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assays per 10 mL tube of blood. For RBC-lysed blood samples whh a 10-fold increase in 
leukocvie concentration, about 100 assays could be done per tube of blood. A panel of 64 
3-color assays with 50 or more target antigens is under development using both whole 
blood and lysed formats. It should allow idenlif.cation of more than 200 cell populations. 

5 

Fxam ple Six 
Tntrarpllular Staining 

Intracellular molecules can be measured with MLSC technology. PBMC were 
cultured for 5 hours in the presence of PHA and ionomycin. Cells were stained with Cy5.5 
10 anti-CDS to identify cytoxic T cells, fixed, permiablized, and stained with Cy5 anti- 

merferon-gamma (IFN-y) to detect the intracellular cytokine. Data in Figure 8 shows that 
IFN-y is detected only in stimulated cells. A control reagent (MOPC) does not label the 
cells. Among the CDS T cells, 20 % express intracellular IFN-y. 



20 



15 F-xample Seven 

T^^nHfyina Riolnpical M ^rVpr. for the Treatment of Alle rp y and Asthsma 

The present invention can be used to identify biological markers for allergic 
asthma. Asthma is common chronic lung disease of uncertain etiology. It is characterized 
by innammation of the airways leading to symptoms of coughing, wheezing, chest 
tightness, and shortness of breath. These clinical symptoms are thought to be due to hyper- 
responsiveness of the airways and a long-term infiammatory process causing obstruction of 
airflow. The disease causes extreme discomfort and can at times be fatal in the absence of 
appropnate treatment. The clinical manifestations of asthma are thought to result from the 
superimposilion of a vanety of environmental factors on genetic predispositions that 
15 increase the likelihood of developing asthma. Atopy, the hypersensitivity to enviromnental 
allergens, is common in asthma, but not all atopic individuals develop asthma. The relative 
importance of allergic mechanisms is not completely understood. Corticosteroids (inhaled 
and systemic) are efficacious in asthma but have associated with perceived and real side 
■ effects that limit their usefulness. A more complete understanding of response to 
30 corticosteroids might allow for the development of drugs with only local effects within the 
lungs or drugs that have beneficial effects without side effects. 
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A study has been designed lo identify biological markers of atopy, asthma and the 
response to corticosteroid therapy. Subjects are screened for four study groups oflO: 1) 
n.ld asthmatics who have tested pos.ttve to skm test allergens. 2) mild asthnrtattcs who 
have tested negative to skin test allergens. 3) non-asthmatics who have tested pos.t.ve to 
sktn test allereens. and 4) non-asthaT.atics who have tested negative to skin test allergens 
(healthy subjects). All elig.ble subjects are entered onto a single-blinded, placebo 
controlled, randomised parallel study to investigate the effect of the drug prednisone on 
biological markers after 3 days bid treatment. Blood samples are taken pnor to treatment 
on the morning of day 1 and 12 hours after the last dosing on the morning of day 4. 

Subjects undergo ngorous screening includ.ng deta.led medical history and cl.ntcat 
tests for lung function and allergy. Mtld asthmatics have a 1) FEV, > 80% pred.cted, 2) 
documented d.agnos.s of asthma or history of any of the following: cough, worse 
particularly at night, recurrent wheeze, recurrent diff.cuU breathing, recurrent chest 
tightness and 3) a positive methacholine challenge test (Cockcroft DW. et al Clin Allergy 
1977- 7:235 and Juniper EF, et al Thorax 1984; 39:556). Non asthir^atics have a 1) FEV. > 
80O/O predicted 2) no history of asthma and 3) a negative methacholine challenge test. 
Allergic subjects have a positive skin test to at least oneof a panel of allergens. 

Examples of clinical data include Haematology: white blood cell count (WBC), red 
blood cell count (RBC), hemoglobin (Hb). hematocrit (HCT), mean cell volume (MCV), 
mean cell haemoglobin (MCH), mean cell haemoglob.n concentration (MCHC) platelet 
count neutrophil count lymphocyte count, monoo.e count, eosinophil count, basophil 
count and ESR - erythrocyte sedimentation rate; blood biochemistry: alkaline phosphatase, 
alanine transaminase, aspartate transaminase, gamma-glutamyl transpeptidase, albumm, 
total protein, total bilirubin, urea, creatinine, sodmm, potassium, glucose; unnalysis: 
protein glucose, ketones, bilirubin, blood, leucocytes; Hepatitis and HIV testing: HIV I 
and II Hepatitis B surface antigen. Hepatitis C antibody. All clinical history and test 
parameters will be included in the master database for statistical analysis, evaluation as 

covariates and data mining. 

Atopic asthma is an immunologic disease mediated by IgE antibodies. Exposure to 
D alleroen causes B cells to synthesize IgE, wh.ch binds to the high affinity receptor mast 
cells residmg in the mucosa of the airways. On re-exposure to the allergen, am.gen- 
annbody interactions on the surface of the mas, cells triggers release of mediators of 
anaphvlaxis stored in mast cell granules, includ.ng: histamine, tryptase, PGD:. leukotnene 
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C. and D. md platelet activating factor (P AF). These soluble factors induce contraction of 
air smooth muscle and cause an immediate fall in the FEV, Re-exposure to allergens also 
leads to the synthesis and release of a vanety of cytokines: IL-4, IL-5, GM-CSF, TNF-a, 
TGF-P, from T cells and mast cells. These cytokines attract and activate B cells, which 
leads to the production of more IgE, and eosinophils and neutrophils, which produce 
eosinophil cationic protein (ECP), major basic protein (MB?) and PAF. These factors 
cause edema, mucus hypersecretion, smooth muscle contraction and increase the bronchial 
reactivity that is typically associated with the late asthmatic response, indicated by a fall in 

FEVi about 4-6 hours after exposure. 

A broad panel of cellular and soluble factor measurements are applied to the subject 
blood samples with the goal of discovery biomarkers. The study design supplies 
mformation of inter-individual variability within groups, and inter-group differences in 
marker expression. It is believed that the inter-group differences (e.g. allergic non- 
asthmattc versus non-allergic non-asthmatic) will be greater than inter-individual 
variability within groups. It is further believed that prednisone therapy will result in 
significant intra-individual changes in marker expression. 
Cellular assays 

A panel of 64 three color cellular assay, focusing on immune and inflammatory 
parameters in the blood, has been prepared and tested for initial atopic asthma. The panel 

IS given in Table 7. 
!snliihle Factors 

The study will also look at a broad panel of soluble factors. Immunoassays, in the 
sandNvkh-based chemiluminescent ELISA format, are used for the following targets: 

Cytokines, chemokines and their soluble receptors: IL-1 alpha, lL-1 beta. IL-1 RA, 
IL-1 sRJ, IL-1 sRIl, IL-2. IL-2sR, IL-3IL-4, IL-5, IL-6. IL-6 sR. IL-8. IL-10, lL-12 p40, 
IL-12 p70, IL-l 3, IL-16, IL-17, MIF, MIP-1 alpha, MIP-1 beta, RANTES, sTNFalpha RI 
*, sTNFalpha RII *. TGF beta, TNF alpha, alpha, TGF beta2, TGF beta3, Oncostatin M. 
M-CSF, GM-CSF, IGF-1, PDGF-BB. FGF-4, FGF-6. FGF-7. Fas, VEGF, MCP-1. PF-4. 
EOTAXTN, IFN gamma. Immunoglobulin: IgAl Kappa, IgAl Lambda, IgAl. 2 Kappa, 
) IgAl, 2 Lambda, IgA2 Kappa, IgA2 Lambda, IgE total, IgGl Kappa, IgGl Lambda, IgGl 
total, IgG2 Kappa, IgG2 Lambda, lgG2 total, lgG3 Kappa, IgG3 Lambda, IgG3 total, IgG4 
Kappa, IgG4 Lambda, IgG4 total, IgG total , IgG total Kappa, IgG total Lambda. IgM 
Kappa, IgM Lambda, IgM total. RFIgA. RFIgG, RFIgM, RF total. Acute phase proteins: 
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CRP, SAA; Matrix metalloproteinases and their inhibitors: N4MP-3. MMP-9, TIMP-1, 
TIM?-:-. Soluble adhesion molecules: sCD54 (ICAM-1), sCD62E, sCD62P. 

Additional soluble factors which are measured by immunoassays or mass 
spectroscopy assay include, but are not limited to. Cytokines, chemokmes and their soluble 
5 receptors: lL-9. IL-U, lL-14. lL-15, lL-18. sCD23, eosinophil proteins: EC?. MBP. 
Immunoglobulin: Allergen specific IgE, carbohydrate modified Ig; a variety of 
prostaglandins; a variety of leukotrienes, histamine. 

Data output from the cellular assays, soluble factor assays, medical histories and 
screening labels are combined into a single database. To identify potential biological 
10 markers (cell counts, antigen intensity on particular cell types, soluble factor 

concentrations, etc) associated with categorical clinical variables (disease status, 
prednisone or placebo, before or after therapy) a variety of ANOVA and discriminant 
techniques can be used. Where appropriate, demographic and clinical variables (.such as 
age. gender, specific history results) can be included as covariates in the models. 
1 5 Techniques can be implemented with S AS, Statistica, Statv.ew or similar statistical 
analysis software packages. 

F.xample Eight 

T Tc. .h. pr..Pn, invention t o iH^ntifv hinlQgical markers following the administration of 

20 aspirin 

The present invention can be used to idemify biological markers for evaluating the 
effects of dmg administration on cellular and soluble factors to be performed on small 
samples of peripheral blood. It is expected that these assays will make possible analysis of 
the effects of different doses of drugs on cellular and soluble markers in human peripheral 
25 blood. In this example the widely used over-the-counter drug, aspirin (acetylsalicylic acid), 
is administered to human volunteers. Different doses of the drug will be orally 
administered; blood is drawn before and at various time points after administration, and 
panels of cellular and soluble factor assays are undertaken. The aspinn is expected to 
cause changes in the cellular and soluble components of blood. 
30 Aspirin is routinely used for two main indications: 1 ) to reduce the risk of coronary 

and cerebral thrombosis and 2) as an analgesic/anti-innammator>' agent. The mechanism 
underlying the first indication is believed to be irreversible inhibition of the enzyme PGH- 
synthatase in platelets. A prostaglandin product of this enzyme in platelets is converted to 
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thromboxane A2. which facilitates platelet aggregation and thrombosis. A side effect of 
prostaglandin synthesis .s the generation of oxygen free rad.cals. which in the presence of 
redox-oxidattve metals convert unsaturated fatty acids .nto aldehydes. A relatively stable 
product of lipid oxidat.on is malondialdehyde (MDA). Th.s compound is routinely assayed 
5 colorimetencally or nuorometncally followmg interaction with th.obarbituric ac.d (TBA. 
Aspirin, by inh.bitmg prostaglandin synthesis, is expected to decrease MDA levels m 
peripheral blood platelets. Th.s is one parameter that is expected to change followmg 
aspirin admimstration. Changes in other markers of platelet activation such as changes m 
the expression of CD62P and CD63 may also occur. 
10 E-type prostaglandins suppress lymphocyte activation and the production of tumor 

necrosis factor-a (TNF-a) by the cells of the monocyte-macrophage lineage. If there is 
some level of lymphocyte activation and TNF-a production in normal healthy persons, this 
may be increased after aspmn treatment and detectable in the peripheral blood. These are 
examples of expected changes following aspirin administration; if many markers are 
1 5 assayed, unexpected changes may also be found, and may prove to be more interesting than 
those expected. 

The study is designed to identify the effects of aspirin on blood parameters. Eligible 
subjects are randomly assigned to orally administer asp.nn according to one of three dosing 
schemes. Group I, 1 dose (325 mg tablet) after breakfast. Group II. 2 doses (650 mg) after 
20 breakfast and Group III. 2 doses after breakfast and 2 doses after diner (1300 mg total). 
There are 10-12 subjects per cohort. Blood samples are taken before, during and after 
aspinn admimstration. The schedule is given in Table 8. Subjects are healthy individuals 
age 1 8-65 who are not taking other asp.nn other non-steroidal anti-inflammatory drugs nor 
cunently under care, which requires the use of anti-inflammatory (steroidal or non- 

25 steroidal) drugs. 
rellular assavs 

A panel of 42 three-color cellular assays are used for the initial study see Table 9. 
The panel includes immune and inflammatory parameters and contains some of the assays 
_ listed in Example 7. It also includes a series of assays for platelet function (1-17). These 
30 assays include direct measurements in diluted whole blood (WB,- 1-9) as well as thrombin 
stimulation assays (TRT, 10-13) and stimulation controls (NTRT, 14-17). 
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jsnluhle Factors 



A broad panel of soluble facors as described in Example 7 will be part of the study. 
Addu,onal measurements mclude : Von Willebrand factor. b-Thromboglobulin, 
Thromboxane B2. 6-keto PGF and malondialdehyde. Soluble factors will be measured 
from plasma. In addition, some soluble factors (e.g MDA. prostagland.ns leukotnenes) 
will be evaluated for the stimulated samples and controls. 
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TABLE 1 



j Major Markers 


Subsettins antigens 


T Cell: 

CD:.CD3.CD4. CD5-CD7. 
CDS 


Memorv/Activation/Co-stimulation: CD6. j 
CD25, CD26. CD27, CD2S, CD3S, CD43, 
CD4?RA/RO. CD49a-l, LUoV, (.u iv.^u / 1. i 
CD80, CDS6. CDl 52(CTLA4). 

CD154(CD40U 


Pb Cell: 
CDI9,CD20.CD2LCD22, 

CD23, CD72 


Adhesion: CDUa/b/c(niicgnns), UU-^', 
CD31, 

CD44, CD54(ICAM-1). CD5S(LFA3), 
CD62E/L/P{seleciins), CDl02(lCAM-2), 
CD104,CD138 


1 Antigen-presenting Cell: 
CD13,CD14.CD15. CD33 


Antigen receptors: 

TCR: aPTcR, ySTcR, speciTic TcR vp panel l 
Sis: IgM. IgG. la A 


NK ceil: 

CD16.CD56.CD57.NKBl 


FcR: CD16(FcyRIII), CD32(FcyR11). 
CD64(FcYRn 


1 Granulocyte: 

CD13,CDI5. CD16, CD33 


Cytokine receptors: CD25/CD122(IL2R). 
cb95(Fas).CD116(GN4CSFR). 1 
CDwl 19(IFNyR), CD120(TNFR), 1 
CD121aA)(ILlR), CD123(IL3R), 
CD124(IL4R), CDwl25(IL5R), CD126(IL6R), 
CD127(IL7R), CDwl28(IL8R) 




Nonlineage: CD9, CD35. CD40, CD45, HLA 
class II DR. DP, DQ, PAN. CDwl 50(SLAM) | 


*Some cell surface antigens are in more than one category. 
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TABLE 2 



Dye Type 



Cvaninc Dves 



Bodipy* 



Plivcobiliproieins 

1 — 



Tandem Dyes" 



PEG Stabilized- 



Microspheres' 



Dyes (Excitation/Kmission maximum) 
r» C^^l^;^ i 2"^ Color Color 



Cv5 (650/667) 



BODIPV 630/650-X 



APC (652/660) 



Cv5-5 (678703} | Cy7 

(743/770) 



BODIPV 650/665-X 



; Cy7-APC 
i (652/780) 



La JoliaBlue 
(680/705) 



Scarlet (645/680) 



Dark Red (660/680) 



Far Red 
(690/720) 

Infrared 
(715/755) 
Transfluor 
(633/720) 



Amersham, -Molecular Probes, -^Multiple vendors, 'PharMingen REF ' Diatron 
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TABLE 3 

Assav Panel 



Potential populations detected 



Comments 



C ombo I Cy:> \ ^y^-^ ^ — , . . 

VI V,.. T cpII .i.h.rt<; (seco ndarily monocytes based on CD 4, NK based on CD7) 

■ J ^ ■ - — . ^ ^r.,^_- ' T--..> r-r^A -InPm-SnOl. 



CD3/SR0 i CD4/SR05 



54 



1 CD3/SR0 
54 



Total CD4 
Total CD3 . 
i CD3-4+ (CD4T) 
CD3-r4-(CDST) 
CD3-4+ (mono) 



CDS/SR12 



CD27/SR 
162 



CD4/SR05 



Total CD3 
Total CDS 
CD3+8+ (CDST) 
CD3+8- (CD4 T) 
CD3-8+ 



CD27/SR 
162 



CD7/SR1 
29 



CD8/SR12 
3 



CD4/SR05 
1 



CD7/SRI 
29 



CD8/SR12 



Total CD27 
Total CD4 
CD27-4+ (CD4 T) 
CD27-^4- (CDST) 
CP27-4+ (mono) 



Total CD27 
Total CDS 
CD27+8+ (CDS T) 
CD27+8- {CD4 T) 
CD27-8+ 
Total CD7 
Total CD4 
CD7+4+ (CD4 T) 
CD7+4- (NK + 8) 
CD7-4+ (mono) 



CD5/SR0 
52 



CD7/SR13 
6 



Total CD7 
Total CDS 
CD7+8+ (CDST) 
CD7+8- (NK + 4J 
CD7-8-^ 



Total CD7 
Total CD5 
CD7+5+ (T) 
CD7-r5- (NK) 
CD7-5+ 



i apTCRyS 
' R0S9 



CD7/SR13 
6 



Total CD7 
Total aPTCR 
CD7+aPTCR+ (T) 
CD7-i'aPTCR-(NK) 
CD7-HaPTCR^ 



In Pro-5001, 
1 (5 new pops) 



In FrooOOU 
(4 new pops) 



NewCD27 
(4 new pops) 



New CD27 
(3 new pops) 



New CD7 
(4 new pops) 



New CD7 
(3 new pops) 



New CD7 
(4 new pops) 



New aPTCR 
(4 new pops) 



1 9 


CD45R.A.' 
SR181 


CD4/SR05 
1 


CD4+45R.A- 
CD4+45RA- 


In Pro- 5 00! 
(1 new pops) 


1 
1 

( 


10 

i 


CD45R.\ 
SRlSl 


: CDS/SRI 2 


1 CDS-45RA- 
! CD8-45R.^-^ 


In Pro-5001 | 
i (2 new pops) 
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.•.r«'?^, 



n 



Combo U 



TABLE 3 (CONTINUED) 



CD62L S i CD4'SR04 

R09S ' 1 

CbS/SR12 



! 12 


CD62L.S 


i 


R09S 


1 ij 


CD69 SR 




1 099 


14 


CD69/SR 




099 


15 


) CD25/SR 




I 095 



CD4/SR05 



CDS/SRI 2 



CD4+62L- 
CD4+62L7 
CDS+62L- 
CDS+62L+ 
CD4+69- 
CD4+69 



CD8+69- 
CDS+69- 



/// FrO'500} 
(2 new pops) 
fn Pro-5001 
(2 new pops) 
In Pro-5001 
(2 new pops) 



fn Pro-5001 
(2 new pops) 




New CD25 
(2 new pops) 



CvS 



I Potential populations 
detected bv this combination 





t 


ot stains — ' 


16 


CD5/SR0 

t 
1 


CD19/SR0 
50 


Total CD5- 

Total CD19-i- 

CD^■^-l9+ (CD5+ B cells) 


In Fro-50Ul 
(2 new pops) 

i 


17 

1 / 


CD25ySR 
095 


CD19/SR0 
50 


CD19+25- 
CDl9+25^ 


(2 new pops) i 


18 


CD69/SR 
099 


CD19/SR0 
50 


CD 1 9+69- 
CD19+69+ 


In Pro-5001 
(2 new pops) 


19 


CD80/SR 
101 


CD19/SR0 
50 


CD 19+80- 

CU1V+OU+ 


New CDSO 
f 2 new pops) 


20 


CD86/SR 
143 


CD19/SR0 
50 


CD19+86- 
CD19+86+ 


New CDS6 
(1 new pops) 


21 


CD62L/S 
R09S 


CD20/SR1 
60 


Total CD20+ 

CD20+62L- 

CD20+62L+ 


Replace CD 19 
(3 new pops) 


22 


CD45RA/ 
SR181 


CD20/SR1 
60 


CD20+45RA- 
CD20+45RA+ 


Replace CD 19 
(2 new pops) 


■ 


Cv5 


Monocyte 
Cy5.5 

1 


f subsets (also B, CD4 T) 
Potential populations 
detected by this combination 
of stains 


Comments 


23 


HLA 

PAK/SRl 

51 


CD20/SR1 
60 

1 

( 

i 


Total PAN 11- 
Toial CD20+ 
PAN 11+20+ (B) 
PAN n+20- (mono) 


New PAN 11 
(3 new pops) 


24 


HLA 1 CD20/SR1 
2DR/SR1 \ 60 
i 47 i 

i i 


Total DR- 
Total CD20+ 
DR+20- (B) 
DR+20- (mono) 


New DR 
(3 new pops) 


25 

1 


HLA ; CD4/SR05 
PAN'/SRl 1 1 

1 

t 


Total PAN 11- 
Total 

\ PAN n-^4-^ (mono) 
PAN il-4-(B) 
PAN il-4'^(T) 


New PAN II 
, (3 new pops) 

• 
J 

1 
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: 26 



' HLA 2 



TABLE 3 (CONTINUED) 

I CD4.'SR05 i Total DR- 



New DR 





DR-'SRU ' 
7 


1 

1 


Total CD4- 
DR+4-^ (mono) 
DR+4- (B) 
DR-1- (T) 


(3 new pops) ! 


27 

f 


CD33/SR ; 
094 j 


CD4/SR05 
1 

1 


Total CD4+ 
Total CD33-H 
CD33^4+ (mono) 
CD33-4+ (CD4 T) 


New wav to detect 
mono 

(3 new pops) 


i 2S ' 

♦ 

1 


CD14/SR 
179 

1 
1 


CD4/SR05 
1 


Total CD4-f 
Total CD 1 4+ 
CD4+14+ (mono) 


New CD 1 4 : 
(2 new pops) 


29 

■ t 


CD14/SR 
179 


CD3/SR05 
5 1 


Total CD14- 
Total CD3- 


Confirmation of C U3 
and CDl 4 subsets (no 
new) 


30 1 


CDSO/SR 
101 


CD33/SR1 
06 


Toial CD33- 

CD33-t-80- 

CD33+80- 


(2 new pops) 


31 

1 

1 


CDS6/SR 
143 


CD33/SRI 
06 


Total CD33+ 

CD33+S6- 

CD33+86+ 


(2 new pops) 


32 


CD45RAy 
SR181 


CD33/SR1 
06 


Total CD33+ 

CD33+45Rj\- 

CD33+45RA+ 


(2 new pops) 


33 . . 


CD62LyS 
R09S 


CD33/SRi 
06 


Total CD33+ 

CD33+62L- 

CD33+62L+ 


(2 new pops) 






Granulocyte subsets 




Combo 

n 


Cv5 


CyS.S 


Potential populations 
detected by this combination 
of stains 


Comments 


34 

i 


CD16/SR 
065 

i 


CD45/SRI 
39 

t 


Total CD45+ (total wbc) 
Total CD 16+ 
Large, small CD45-t-l6- 
Larpe. small CD45-^16- 


In Pro-5001 
(6 new pops) 

t 


35 


CD16/SR 
065 

! 


CDllb/SR 
070 


TotalCDllb+ 

Total CD 1 6+ 

Large, small CD 16+ 11b- 

Larue. small CD16-llb- 


in Pro-5001 
(4 new pops) 


36 


CD62L/S 
R098 


CD45;SR1 
39 


! 'Total CD45 
Laree. small CD45+62L- 
Laree. small CD45-62L- 


New gran combo 

(4 new pops) i 


J / 


CDllb/S 
R102 


CD45/SR1 1 Total CD45 
39 ; Large, small CD45+1 lb- 
! Larce. small CD45+1 lb- 


New gran combo 

(4 new pops) 1 


1 3S 

t 
1 


CD45RB/ j CD4.SR05 
SR144 ! 1 

! i 


Toial CD4- 1 New CD45HJ3 
Total CD45RB+ ■ (3 new pops ') 
CD4+45RB+ i 
CD4-45RB- 1 
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TABLE 3 (CONTINUED) 



j 1 new pop - 
! total cells 
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TABLE 4 

Pilot study- linear discriminant analysis 

for distinguishing R.\ and Blood Bank samples in data set 

Samples, n=5l. Blood bank = 26, R.A = 25 



Incorrect 

sample 
assignment 



10 



II 



12 



13 



14 



Single Marker 



Single Markers 



T CR-aP T cells as a % of leukocytes 
CD7 cells as a % of leukocytes 
CD3 cells as a % oFleukocxtes 



CD16 cells as a % of leukocytes 
CD45 cells (all leukocytes) 
CDS T cells as a % of leukocytes 
C D20 intensity on B cells 
Marker Pairs 



CD4 T cells as a % of leukocytes 
CDS T cells as a % of leukocytes 



CD20 intensity on B cells 
CD7 cells as a % of leukocytes 



CD45 cells (all leukocytes) 
CDS T cells as a % of leukocytes 



CD20 T cells as a % of leukocytes 

CD7 T cells as a % of leukocytes 

Most measurements are averages from 2 to 6 assays 
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TABLE 5 



Information on reagent combinations in panel for Pro-5003 



004 



005 



006 



007 



CD2/SR306 I CDS/SR212 



CD2-4- 
CD2-4- 



CD3/SR054 I CDS/SR212 



CD7/SR20S 



4 populations 
CD2+S+brighl 
CD2+8+dulI 
CD2-^8+total 
CD2-f8- 
3 populations 
CD3+4+ 
CD3+4- 
I CD3-4+ 
3 populations 
CD3+8+ 
CD3+8- 
CD3-8+ 



CD4/SR051 



CD7/SR2G8 CDS/SR212 



3 populations 
CD7+4+ 
CD7+4- 
CD7-4+ 



3 populations 
CD7+8+ 
CD7+8- 
CD7-8+ 



apTCR/SRO j CD7/SR211 
89 



OOS CD27/SR162 



CD4/SR051 



5 populations 

ap-^7+ 

ap-7+brighl 

aP-7+dull 

aP-7^total 

aP-i-7- 

3 populations 
CD27-f4- 
CD27-4- 
CD27-4- 



rNo. 


Cv5 


Cv5.5 1 


Populations 


Comnients 


i 001 


CD2.'SR306 


CD4/SR051 


3 populations 


■ ; 






CD2-4- 


( 
1 




Pro-5002 



Pro-5002 



Pro-5002 



Proo002 




Pro-5002 



009 ; CD27/SR162 



CDS/SR212 



I 7 populations 
j CD27-H8+brigln 
CD27-^8+dun 
CD27-rS-lotal 
CD27+8- 
CD27-8-brighi 
CD27-S^dull 
CD27-S+ioial 



Pro-5002 
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TABLE 5 (CONTINUED) 




01 1 



CD6/SR364 ICDS/SR212 



lOlO ■CD6/SR364 j CD4/SR051 \ 3 populations 

CD6-r4- 
CD6-H4+ 
CD6-4- 
7 populations 
CDG+S+brighi 
CD6+S+dun 
CD6+8+total 
CD6+S- 
CD6-S-i-brighl 
CD6-8+dull 
CD6-S+total 
3 populations 
CD26+4. 
CD26+4+ 
CD26-4+ 
5 populations 
CD26+S- 
CD26+8+ 
CD26-8+bright 
CD26-8-i-dull 
CD26-8+total 
2 populations 
CDS 7+4+ 
CD57-4+ 



012 iCD26/SR363 



CD4/SR051 



I 



013 



CD26/SR363 1 CDS/SR212 



014 



015 



CD57/SR197 



CD4/SR051 



CD57/SR197 



CD8/SR212 



6 populations 

CD57+8+bright 

CD57+8+dull 

CD57+8+total 

CD57-8+brighi 

CD57-8+dull 

CD57+S+total 



016 



NICBI/SR37 
5 



CD6/SR362 



2 populations 

NKBI+6+ 

NKBI-6+ 



017 



CD45RA/SR I CD4/SR051 
346 



3 populations 
CD45RA-4^ 
CD45RA+4-r 
CD45RA+4- 



OlS 



CD45R.VSR i CDS/SR212 
346 ! 



019 



3 populations 
CD45RA-S+ 
CD45RA-S- 
CD45RA+8- 



CD62L/SR2 
' 27 



CD4/SR051 



3 populations 
CD62L+4+bright 
CD62L-4+bright 
CD62L-^4+dull 



Pro-5002 



i Proo002 



Pro-5002 
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I 0:0 i CD62LSR2 



TAB LE 

CDS/SR212 



024 



025 



026 



027 



028 



1 



69 

5 (CONTINUED) 
6 populaiions 
CD62L-S-rbrighi 
i CD62L+S+dull 
' CD62L-^S+ioial 
CD62L-S^bright 
CD62L-S+dull 
CD62L-S-Motal 



Pro-5002 



CD69/SR099 



CD4/SR05i 



023 ICD25/SR231 



CD4/SR051 



2 populations 
CD69+4+ 
CD69-4-f 
2 populations 
CD69+S- 
CD69-S-H 
2 populations 
CD25+4-^ 
CD25-4+ 



Use Ivsed (5n) 



; Use lysed (5x) 



CD25/SR231 CD8/SR212 




TCR- 

VB3/SR215 



CDS/SR212 



TCR- 

VB5/SR216 



CD8/SR212 



TCR- 

VB8/SR217 



CD8/SR212 



2 populations 
CD25+8- 
CD25-S+ 

3 populations 
TCRVB3+8+ 
TCRVB3-8+ 
TCRVB3+8- 
3 populations 
TCRVB5+8+ 
TCRVB5-8-^ 
TCRVB5+8- 



! Use Ivsed (1 Ox) 



1 
I 



Use [ysed(lOx) 



NKBI/SR37 
5 




CD4/SR051 



NKB1/SR37 
5 



CDS/SR212 



3 populations 
TCRVB8+8- 
TCRVB8-8-^ 
TCRVB8+S 
3 populations 
NKBl+4-r 

NlCBl+4- 
KKBl-4- 



j Use lysed (lOx) 



3 populations 
KKB1+8+ 
NKBl-8- 
NKB1-8+ 
3 populations 
CD5-19- 
CD5+19- 
CD5-19- 
3 populaiions 
CD6-19- 
CD6-19-^ 
CD6-19- 



Use Ivsed (lOx) 
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040 



041 



TABLE 5 (CONTINUED) 



CD27.SR162 | CD19/SR050 



033 CD2/SR306 ! CD19ySR050 



043 



034 CDS0/SR22S CD 1 9/SR050 



3 populations 

CD27-19+ 

CD27+19+ 

CD27-^19- 

3 populations 

CD2-i-19- 

CD2-19v 



Stanford study 




044 



035 CDS6/SR236 CD19/SR050 



2 populations 
CDSO+19-^ 
CDS0-CD19+ 



Use lysed(lO\) 



6 CD25/SR231 CD19/SR050 



2 populations 

CD86+19^ 

CDS6-19+ 



Use lysed (lOx) 



037 ■ CD69/SR099 CD 1 9/SR050 



CD62USR2 
27 

CD45RA/SR 
346 



HLA 

PAN/SR229 

HLA 
2DR/SR230 



042 I HLA 

" DP/SR370 



HLA 

PANVSR229 



HLA 
2DR'SR230 



f 045 i HLA 

1 DP/SR370 



CD20/SR224 
CD20/SR224 



2 populations 

CD25+19-^ 

CD25-19- 



Use ivsed(lOx) 



2 populations 

CD69+19+ 

CD69-19+ 



CD20/SR224 



1 population 
CD62L+20+ 

2 populations 
CD45RA+20-^ 
CD45RA+20- 



Use lysed (I Ox) 



2 populations 
HLA? AN 11+20+ 
HLA? AN 11+2 0- 
2 populations 
HLADR+20+ 
HLADR+20- 



Pro-5002 



CD20/SR224 



CD4/SR051 



2 populations 
HLADP+20-r 
HLADP+20- 



3 populations 
HLAPAN+4- 
HLAPAN+4- 
HLAPAN-4- 



Pro-5002 



CD4/SR051 



\ 3 populations 
' HLADR-4-r 
HLADR+4- 
HL.ADR-4+ 



Pro-5002 



CD4/SR051 



3 populations 
HLADP+4-^ 
HL.ADP+4- 
HL.ADP-4- 
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046 



TABL E 

CD33 SR232 ! CD14/S R3 66 



04 



- ! CD33. SR232 I CD4/SR051 



5 (CONTINUED) 
4 populations 
CD33-14-^ 
CD33-^14^iotal 
CD33dulll4H- 
CD33^14 
3 populations 
CD33'-4- 
CD33-4- 
CD33-4- 



Good crosscheck on 
CD33 

use doped down 
! CD14 

Pro-5002 
Second check on 
\ CD33 



CD16''SR065 I CD14/S R3 66 




049 



CD64/SRIS2 



CD4/SR051 



5 populaiions 
CD16-i-14+bright 
CD16^l4+duU 
CD16-!-l4+total 
CD16-14-e 
CD16-^14- 



050 



CD64/SRIS2 CD16/SR072 



2 populaiions 

CD64+4-H 

CD64-4+ 



051 



CD45RAySR 
346 



CD14/SR366 



3 populations 
CD64+16+ 
CD64+16- 
CD64-16+ 



Stanford study 



052 
053 



CD62L/SR2 
27 



CD14/SR366 



2 populations 

CD45RA+14+ 

CD45RA+14- 



054 



CDS6/SR236 CD14/SR366 



1 population 
CD62L+14+ 



CD45/SR132 CD14/SR366 



1 population 
CDS6+14+ 



Doped down CD 14 



Doped down CD 14 



Doped down CD 14 



055 



I CD45/SR132 j CD16/SR072 



4 populations 

CD45+14-total 

CD45brightl4- 

CD45dulll4- 

CD45+14-H 



Nice breakdown ol 
lymphs, grans, nnono 
Stanford study 
Lvsed, 1:4 diluted 
(0.5x) 



3 populations + 
total CD45^ 
CD45-16-^hi_sl 
CD45-M6-lo_sl 
CD45-16- 



Pro-5002 



056 :CD15/SR195 CD16/SR072 



2 populations 

CD15-16- 

CD15-16- 



1:4 diluted blood 



057 , CDIS/SR374 j CD15/SR372 



2 populations 
CDlS-15- 
! CD1S^15- 



' 1:4 diluted blood 



; 1:4 diluted blood j 
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058 1 CD45/SR132 i CD14/SR366 

1 

• 

! 

1 i : 


4 populations | 
CD45-^14-iotal ' 
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CD45-H14- 


Nice breakdown of 
lymphs, grans, mono 
Stanford study 
1:4 diluted blood 


1 059 CDnb/SR06 
1 

i ^ 


CD15/SR372 


2 populations 
CDllb+15-r 

CDllb-l5- 


1 :4 diluted blood 

( 


060 ICD32/SR1S0 

r 


CD15/SR372 

f 

1 


j 2 populations 
CD32+15- 
CD32+15- 


! 1:4 diluted blood 

\ 

1 . — — ■ — 
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Soluble Factor Immunoassays 



No 
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11 



12 



13 



14 



15 
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17 



IS 



21 



ASSAY 



IL-1 alpha 



IL-l bcia 



IL-lra 



IL-lsRl 



IL-lsRIl 



lL-6 



IL-S 



IL-IO 



RF (all Isotypes) 



RF IgM 



RF IgG 



RF IcA 



CRP 



SAA 



MMP-3 



MMP-9 



TIMP-1 



TNF alpha 



INF camma 
TGF beta 
sCDG2E 
SCD62P 



No 



ASSAY 



1 --^ 


lL-2 


24 


IL-3 


1 -5 


IL-4 




IL-5 1 




MMP- 1 


28 


MMP-2 


29 


MMP13 1 


30 


TlMP-2 


31 


TIMPo j 


32 


sCD44 


33 


ScD54 lCAM-1 




SCD62L 


35 


RANTES 


36-51 


Immunoglobulin H and L 1 




isotypes (16 assays) 



S ~ soluble 
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" j : I !Cv7- , j 

ASV^^-149rvS C D45 ISR712 'cv5.5 tCDM jS RSO? l\?C ICDlO ISR4 

— ' — '-^-^\ T * ■ !Cy^ 

. Ur^.,; :cn>-?i-t l-. N ^ irni-i isRSO:^ilAPC CD16 SR4 



(WB 
133 b-25\ 



ASy 3-132 lC:v5:CD45 iSR71 



2 !Cv5.5 ICDU 1SR503 t 



IAPC ICDIO 



Lyscd 
SR433 0.25\ 



T Cells (all or both 4 and 8) 



ASV3-001Cv5 



CD4 ISR349 |Cv5.5 CDS 



A$Y 3-l50Cy5 ICD2 

il 

ASY3-066 k:v5 iTCRaP 

Cv5 rrcRgp 



SR306 



SR660 
SR660 



•v5.5 1CD4 



■Cy7- j 

5;R212 IAPC lCD3 SR435 NVB 



jCy7- 

ISR506 lAPC CDS 



SR529 WB 



jCy7- 

1SR663 IaPC CDl 
jCy7- 

|CD4 jSR506 kPC 




ASY3-l42iCy5 



lASV3-145iCv5 



CD44 ISR55S 
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WB- 
SR529 i0.25\ 



BNSDOCIO <WO.. 0065472A1J > 



wo 00/65472 



PCT/USOO/1 1296 



TABLE 7 (CONTINUED) 



! ! I 1 I ' ' ' 

Dve kntigeu iSR?^## iDvc lAntigen {SRUUU IDxe [.Antiocn 'gR### iFormatt 

CD-4 T Cells — — 1 p 
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K^U-t 1 v,cii> _ ; i 1 r 

^ — j I t ' ^ 

ASV^^-056iCvS rn62 L !SR227 fvS.S ^CDA5\L\ iSR453 l\?C f.DA |SPo30 (WB 

1 I T !Cv7- ; ! i 



ASY3-004 



Cv5 CD4 



ASV3-153 



ASV3-154|Cy5 'k:D57 |$R342 |cy5.5 



SR349 Cv5.5 fcD27 



I ICy 
SR161 lAJPC ICD3 



1 

|SR435 KVB 



53|cv5 iCD26 ISR363 |cy5.5 [CD 



Cv7- 

SR506 lAPC |CD3 



tD4 



Cy7- 
LSR506 IaPC 



SR435 



\VB 



ICD 



3 lsR435 
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i 
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ksv3- 


1 

CD5 5 


t 

iR297 ( 


i 

~v5.5 CD 19 :SR050 


Cy7- 
APC 


CD20 


1 \ 
SR71S ■ \VB 


ASV.- 
161 ( 


-^vsi rD72 i; 


iRlOO < 


1 
1 

:v5.5 


1 

CD19 : 


5R050 


Cy7- 
APC 


CD20 


SR71S i 


1 

WB 




ASV3- 
162 < 




I 

CDSO iSR22S < 


::y5.5 


CDS6 


SR706 


Cy7- 
APC ' 


L 

CD20 


SR71S 


WB 


ASY3- 
163 H 


L\5 


CD69 , 


SR235 < 


Cv5.5 


CD71 1 


SR655 


Cv7- ! 

AJ^C 1 CD20 


SR729 1 
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B Cell and 

ASY3- 
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IMoi 
Cv5 


10 
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SR370 
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t 
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i . 
SR718 i WB 
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i 
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SR500 


Cv5.5 


CD 14 


SR503 
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CD20 1 
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SR230 


" ' i 

Cv5.5i CD14 iSR503 
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CD 20 
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WB i 
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N 


SR229 
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WB 
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SRI 79 
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CD45RA 


SR453 
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WB 
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Cy5 


CD40 


SR634 


Cy5.5 


CD14 


SR503 


Cy/- 
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SR718 


WB 


ASY3- 
170 
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Cy5 


CD62L 


SR227 


CyS.S 
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SR5Q3 


Cy7- 
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CD20 


SR729 


WB 


ASY3- 
171 


Cy5 


CD33 


SR232 


Cy5,5 


CD14 


SR503 


Cy7- 
APC 


CD4 


SR530 


WB 


ASY3- 
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Cv5 
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SR349 
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CDHb 


SR371 
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CD 1 6b 


SR359 
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1 
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■ ^ IB 
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SR433 


WB- 
0.25x 




ASY3- 
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Cy: 


;! CD32 


SR37S 


) CV5.5I CD15 


1 

SR372 


Cy7- ' 

APC i CD16 


SR433 


WB- 

0.25X 


i 

i 

t 
1 


ASY3- 




> CD18 


1SR37^ 


— . , — , 

1 

1 Cv5.5f CDllb 


SR371 


Cy7- 
i APC 


CDt6 


! SR433 


WB- 
0-25x 




ASY3- . 
17a iCv: 


>! CD44 


SR55SiCv5. 


i 

5 CD15 iSR37: 


Cy7- 
l APC 


; CD14 


SR719 


WB' 
0.25x 




Asv:- 

175 


Cv. 


5I CDS9 


SR65! 


5 Cv5.5^ CD15 iSR37: 


Cy7- i 
I APC 1 CD14 


; WB- 
SR719 ! 0.25x 


i 

t 
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Cv5 l CD123 , SR289 
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SR704 
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1 DA\ 
1 Fri 


DAY 
Mon 
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1 DAY 1 
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DAY [ 
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Thurs 


DAY 1 
Fri 


DAY 
Thurs I 


1 -3 


0 


11 




-* 

St 


4 


10 


1 Screeninu 








i 




Aspirin 


1 Aspirin 


Aspirin 


Aspirin 




} 


Blood 


1 Blood 


1 


Blood 


1 Blood 


! Blood 


1 Blood 



Blood draw between 8 and 9 am each day 
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=^7^^ ' IChl \Ch2 


. - - 


t 






\ssaY 1 


jye 1/ 
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i 
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I 


Vntigc i 


^muu pyc > 


Vntigenji 




^orni ! 
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1 i 


\SY3- ( 
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( 


SR67 j( 
) 


ry5.5 
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3R67S Cy7 
APC 
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SR641 ^ 




1 

1 
1 


2 . 


ASY3- < 
103 


Cy5 i 


::D42a : 


3R68 


Cy5.5 1 


[:D41a 


SR6S3 Cy7 
APC 


CD61 


SR641 ' 




3 . 


ASY3- 
104 


Cy5 < 


CD42a 


SR68 
4 


Cy5.5 


CD62p 


SR590 Cy7 
APC 


CD61 


SR641 


WB 


4 


ASY3- 
105 


Cy5 


CD42b 


SR68 
5 


Cy5.5 


CD4la 


SR683 Cy7 
APC 


CD6i 


SR641 


WB 


5 


ASY3- 
106 


Cy5 


CD42b 


SR68 

5 . 


Cy5.5 


CD62p 


SR590 Cy7 
APC 


CD61 


SR641 


WB 


6 


ASY3- 
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Cy5 


CD62p 


SR68 
6 


Cy5.5 


CD61 


SR6S1 Cy7 
APC 


CD4la 


SR640 


WB 


7 


ASY3- 
108 


Cy5 


CD63 


SR68 
7 


Cy5.5 


CD61 


SR681 Cy7 
APC 


CD41a 


SR640 


WB 


8 


ASY3- 
109 


Cy5 
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SR67 
3 


Cy5.5 
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Ly7 
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CD61 


SR641 


WB 


9 


ASY3- 
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CD29 
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0 


Cy5.5 
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SR678 


Cy7 
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CD41a 


SR640 

1 


WB 


10 
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SR68 
6 
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SR6S1 
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CD41a 

i 


SR640 
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t 
\ 


11 


ASY3- 
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SR6S 
7 
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CD61 


SR681 
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APC 


CD41a jSR640 

1 

! 


TRT i 


12 


ASY3- 
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PAC-1 


SR67 
3 


Cy5.5 


CD9 !SR678 
1 


Cy7 
APC 


CD61 jSR641 


TRT : 


13 

i 


ASY3- 
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iCyS 

i 


;CD42b 


SR6S iCy5.5!CD62plSR590 

is i . 


iCy7 
APC 


CD61 ISR641 

i 

— i — 


TRT 


14 


ASY3- 
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Cy5 

1 
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SR68 |Cy5.: 
6 I 


) CD61 jSRGSl 


Cy7 
APC 


CD41a SR640 INIKi 
1 1 
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Claims 

--v 1 . A biological marker identification system comprising: 

a) an integrated database comprising a plurality of data categories, said data 

categories comprising. 

5 i) levels of a plurality of cell populations and/or cell associated molecules 

in the biological fluid of an organism, and/or levels of a plurality of soluble factors in the 

biological fluid of an organism, and 

ii) information associated with a plurality of clinical parameters of an 

organism; 

10 b) data from a plurality oforganisms corresponding to said data categories; and 

i) processing means for correlating data within the data categories, 
wherein correlation analysis of data categories can be made to identify the data category or 
categories indicating normal biologic processes, pathogenic processes, or pharmacological 
responses to therapeutic intervention, 

1 5 wherein said identified category or categories are biological markers. 

2. The biological marker identification system of claim 1 wherein said data for 

^ levels of cell populations and/or cell associated molecules are obtained by microvolume laser 
scanning cytometry. 

3. The biological marker identification system of claims 1 and 2 comprising at 
20 least 20 cell population and/or cell associated molecules level data categories. 

4. The biological marker identification system of claim 3 comprising at least 30 
cell population and/or cell associated molecules level data categories. 

5. The biological marker identification system of claim 3 comprising at least 40 
cell population and/or cell associated molecules level data categories. 

25 6. The biological marker identification system of claims 1-3 wherein the soluble 

factor is a soluble protein. 

7. The biological marker identification system of claim I wherein the soluble 

factor is a small molecule. 

8. The biological marker identification system of claim I wherein said data for 
30 levels of soluble factors are obtained by microvolume laser scanning cytometry. 
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9. The biological marker identification system of claim 1 wherem said data for 
levels of soluble factors are obtained by immunoassays. 

1 0. The biological marker identification system of claim 1 comprising at least 20 

soluble factor level data categories. 

1 1 . The biological marker identification system of claim 1 0 comprising at least 30 

soluble factor level data categories. 

12. The biological marker identification system of claim 10 comprising at least 40 

soluble factor level data categories. 

13. The biological marker identification system of claim 1 wherein data from at 

least some of said organisms is included a plurality of times. 

14. The biological marker identification system of claim I wherein said data 

categories further include: 

iii) genotype information associated with an organism. 

15. The biological marker identification system of claim 1 wherein said data for 
levels of soluble factors are obtained by mass spectrometry. 

16. The biological marker identification system of claim 1 wherein said 
infomiation associated with said clinical parameters is selected from the group consisting of 
age, gender, weight, height, body type, medical history, family history, environmental factors 
and manifestation and categorization of disease or medical condition. 

17. The biological marker identification system of claim 1 wherein said data is 
obtained from organisms prior to and after the administration of a therapeutic treatment. 

1 8. The biological marker identification system of claim 1 wherein at least some of 
said data is obtained from an organism having been previously diagnosed as having a 
predetermined disease or medical condition. 

19. The biological marker identification system of claim 1 wherein at least some of 
said data is obtained at a plurality of times from an organism having been previously , 
diagnosed as having a predetermined disease or medical condition. 

20. The biological marker identification system of claims 18 and 19 wherein said 
predetermined disease or medical condition is rheumatoid arthritis. 
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21. The biological marker identification system of claims 1 8 and 1 9 wherein said 
predetemuned disease or^med.cal condition is selected from the group consisting of 
rheumatoid arthritis, asthma, allergy and multiple sclerosis. 

The bioloeical marker identification system of claim I wherein said data 
categon'es comprise levels of a plurality of cell populations and/or celt associated molecules m 
the biological nuid of an organism and levels of a plurality of soluble factors m the biological 
fluid of an organism. 

23. A method for identifying a biological marker for a given disease or medical 

condition comprising: 

correlating infomiation obtained from a plurality of organisms, at least some of said 
orgamsms having said disease or medical condition, wherein information is associated with a 
plurality of data categories, and wherein said data categories compnse. 

i) levels of a plurality of cell populations and/or cell associated molecules in the 
biological nuid of an organism, and/or levels of a plurality of soluble factors in the biological 

1 5 fluid of an organism, and 

ii) information associated with a plurality of clinical parameters of an organism; 

identifying a data category where organisms having said disease or medical condition may be 
differentiated from those organisms not having said disease or medical condition, wherein said 
identified category is a biological marker for said disease. 
20 24 The method for identifying a biological marker of claim 23 wherein said data 

for levels of cell populations and/or cell associated molecules are obtained by microvolume 

laser scanning cytometry. 

25. The method for identifying a biological marker of claim 23 comprising at least 

20 cell population and/or cell associated molecules level data categories. 
25 26. The method for identifying a biological marker of claim 25 comprising at least 

30 cell population and/or cell associated molecules level data categories. 

27. The method for identifying a biological marker of claim 25 comprising at least 
40 cell population and/or cell associated molecules level data categories. 

28. The method for identifying a biological marker of claim 23 wherein said data 
30 for levels of soluble factors are obtained by microvolume laser scanning cytometr^•. 
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29. The method for identifying a biological marker of claim 23 comprising at least 

20 soluble factor level data categories. 

30. The method for identifying a biological marker of claim 29 comprising at least 

30 soluble factor level data categories. 
5 31 . The method for identifying a biological marker of claim 29 comprising at least 

40 soluble factor level data categories. 

32. The method for identifying a biological marker of claim 23 wherein said data 

categories further include; 

iii) genotype infomiation associated with any organism. 
10 33. The method for identifying a biological marker of claim 23 wherein said data 

for levels of soluble factors are obtained by mass spectrometry'. 

34. The method for identifying a biological maker of claim 23 wherein said data for 
levels of soluble factors are obtained by immunoassays. 

35. The method for identifying a biological marker of claim 23 wherein said 

1 5 information associated with said clinical parameters is selected from the group consisting of 

age. gender, weight, height, body type, medical history, family histoiy, environmental factors . 
and manifestation and categoriization of disease or medical condition. 

36. The method for identifying a biological marker of claim 23 wherein said 

disease is rheumatoid arthritis. 
20 37. The method for identifying a biological marker of claim 23 wherein said 

disease is selected from the group consisting of rheumatoid arthritis, asthma, allergy and 
multiple sclerosis. 

38. A phenotype of an organism comprising a plurality of biological parameters 
comprising: 

25 i) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

ii) the results of at least 20 assays relating to soluble factors; and 

iii) clinical parameters. 

39. The phenotype of claim 38 comprising the results of at least 40 assays 
30 relating to cell populations andor cell associated molecules. 
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40. The phenotype of claim 3S comprising the results of at least 40 assays 

relating to soluble factors. 

41 . The phenotype of claim 3S further comprising genotype information of said 

organism. 

5 42. A phenotype of a class or subclass of organisms comprising a plurality of 

biological parameters from a plurality of members of said class or subclass; wherein from 
each said member said biological parameters comprise: 

i) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 
IQ ii) the results of at least 20 assays relating to soluble factors; and 

iii) clinical parameters. 

43 . The phenotype of claim 42 comprising the results of at least 40 assays 

relating to cell populations and/or cell associated molecules. 

44. The phenotype ofclaim 42 comprising the results of at least 40 assays 

15 relating to soluble factors. 

45. The phenotype ofclaim 42 ftirther comprising genotype information of each 

-) ■ " 
said member. 

46. A system for creating the phenotype of an organism comprising: 

i) obtaining biological parameters from said organism compnsmg: 
20 a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

b) the results of at least 20 assays relating to soluble factors; and 

c) clinical parameters; and 

ii) entering said biological parameters into an integrated data base. 
25 47. The system ofclaim 46 composing the results of at least 40 assays relatmg 

to cell populations and/or cell associated molecules. 

48. The system ofclaim 46 composing the results of at least 40 assays relating 

to soluble factors. 

49. The system ofclaim 46 wherein said biological parameters further compose 
30 genotype information. 
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comprising: 



S6 

A method for evaluating the effect of a perturbation on an orgamsm 

^ng the phenot>Te of said orgamsm prior to and after said perturbation; 



i) obtami! 
and 

5 ii ) comparing the information in said prior to and after phenotypes to identify 

changed parameters: 

wherein said phenotypes are comprised of: 

a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

,Q b) the results of at least 20 assays relating to soluble factors: and 

c) clinical parameters. 

51. ' The method ofclaim 50, wherein said phenotypes compose at least 40 

assays relating to cell populations and/or cell associated molecules. 

52. The method ofclaim 50, wherein said phenotypes comprise at least 40 

15 assays relating to soluble factors. 

53. The method of claim 50. wherein said phenotypes further comprise genotype 

infonnation of said organism. 

54. A method for evaluating the effect of a perturbation on a class or subclass of 

organisms comprising: 

20 i) obtaining the phenotype of a plurality of members of said class or subclass of 

organisms prior to and after said perturbation: 

,i) companng the information m said poor to and after phenotype to identify 

changed parameters; 

wherein said phenotypes are compnsed of: 
,3 a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

b) the results of at least 20 assays relatmg to soluble factors; and 

c) clinical parameters. 

55. A method for evaluating the effect of a perturbation on an organism or class 

30 or subclass or organisms comprising: 
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' ^ effected by said perturbation and the phcnotype of one or more of said organisms who have 

been effected bv said perturbation: and 

ii) ' companng the mformation .n the phenotypcs of said plurahty of organ.sms 
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who hav e not been effected by said perturbation with the pheuot^T^e of the one or more 
organisms who have been effected by said penurbat.on to identify changed parameters; 
wherein said phenotypes are comprised of: 

a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

b) the results of at least 20 assays relating to soluble factors; and 

c) clinical parameters. 

56. A system for the identification of biological markers of a disease or medical 

condition in an animal model of said disease or medical condition comprising: 

a) an integrated database comprising a plurality of data categories, said data 

15 categories comprising, 

i) levels of a plurality of cell populations and/or cell associated molecules m the 
biological fluid of an animal, and/or levels of a plurality of soluble factors in the biological 

fluid of an animal, and 

ii) information associated with a plurality of physical parameters of an ammal; 
.0 b) data from a plurality of animals corresponding to said data categories; and 

i) processing means for correlating data within the data caiegoncs, wherem 
correlation analysis of data categories can be made to idem.fy the data category or categories 
indicating normal biologic processes, pathogenic processes, or pharmacological responses to 

candidate therapeutic inier%'ention; 
•75 wherein said identified category or categories are biological markers. 

57. The biological marker idemif.cation system of claim 56 wherein said data for 
levels of cell populations^and/or cell associated molecules are obtained by microvolume laser 

scannins cytometry. 

5S. The biological marker idemif.canon system of claims 56 and 57 composing at 
30 least 20 cell population and/or cell associated molecules level data categories. 
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59. The biological marker identification system of claim 5S comprising at least 40 
cell population and/or cell associated molecules level data categories. 

60. The biological marker identification system of claim 56 comprising at least 20 

soluble factor level data categories. 

6 1 The biological marker identification system of claim 56 compnsing at least 40 

soluble factor level data categories. 

62. The biological marker identification system of claim 56 wherein said data 

categories further include: 

genotype information associated with an animal. 

63. A method for identifying a biological marker for a given disease or medical 
condition in an animal model of said disease or medical condition comprising: 

providing an animal model of said disease or medical condition; 

correlating information obtained from a plurality of individual animals, at least some of 
said individual animals having said disease or medical condition, wherein information is 
i associated with a plurality of data categories, and wherein said data categories comprise, 

i) levels of a plurality of cell populations and/or cell associated molecules in the 
biological fiuid of an individual, and/or levels of a plurality of soluble factors in the biological 

fluid of an individual animal; and 

ii) information associated with a plurality of physical parameters of an individual 

0 animal; 

identifying a data category where individual animals having said disease or medical 
condition may be differentiated from those individual animals not having said disease or 
medical condition, wherein said identified category is a biological marker for said disease in 
said animal model. 

5 64. A method for identifying a biological marker for a given disease or medical in 

humans, comprising: 

providing an animal model of said disease or medical condition; 

identifying a biological marker for said disease or medical condition in the animal 
model of said disease or medical condition according to the method of claim 63; and 
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detemiinine if said biological marker is diagnostic or prognostic of said disease or 

medical condition in humans. 

65. A method for assaying a candidate therapeutic agent directed against a human 

disease or medical condition, the method comprising: 

providing an animal model of said disease or medical condition; 

identifying at least one biological marker of said disease or medical condition in said 

animal model by the method of claim 63; 

treaiins said animal model with said candidate therapeutic: and 
monitoring the response of said biological markers in said animal model. 

66. A method for monitoring the results of a clinical study in humans with a given 
medical disease or condition comprising: 

evaluating biological markers in humans that are homologues of biological markers 
identified in animal models of said medical disease or condition. 

67. A method for designing an improved animal model for a human disease or 

medical condition comprising: 

identifying human biological markers relative to said disease or medical condition; 

tailoring the animal model to more accurately simulate said disease or medical 
condition by elevating or reducing the levels of the animal homologues of said human 
biological marker. 

68. A method for identifying an animal model of a disease or medical condition 
comprising: 

i) obtaining the phenotype of a plurality of potential animal models of said 

disease or medical condition; 

ii) obtaining the phenotype of organism having said disease or medical condition; 

iii) comparing the potential animal model phenotypes with the phenotype of the 
organisms havin^ said disease or medical condition to identify the animal model phenot>T3e 
that most closely simulates the phenotype of the organisms having said disease or medical 
condition; 

wherein said phenotvpes are comprised of: 



<WO 0C)65't72AlJ > 



wo 0O/6S472 



PCT/US00/n296 



90 

a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules: 

b) the results of at least 20 assays relating to soluble factors; and 

c) clinical parameters. 

5 69. The phenotype of claim 38 wherein said organism is selected from the group 

consisting of a human, an animal, a plant, and a virus. 

70. The phenotype of claim 42 wherein said class or subclass or orgamsms ,s 
selected from the group consisting of humans, animals, plants and virus. 

71. A method for evaluating the effects of a genetic alteration on a plant or anm.al 

10 comprising: . „ i a 

i) obtammg the phenotype of sa>d plant or an,mal that has been genetically altered 

and the phenotype of the non-genetically altered plant or animal; and 

ii) comparing the information in the genettcally-altered and non-genet.cally altered 

phenotype to identify changed parameters; 

1 5 -wherein said phenotypes are comprised of: 

a) the results of at least 20 assays relating to cell populations and/or cell 

associated molecules; 

b) the results of at least 20 assays relating to soluble factors; and 

c) clinical parameters. 

20 
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